Diffbot provides a set of APIs that enable developers (Enterprise or Individual) to easily use web data in their own applications. Diffbot analyzes documents much like a human would, using a combination of computer vision and natural language processing to determine how the various parts of the document/web page fit together. The algorithm uses statistical techniques to automatically and reliably determine the structural organization of a page, independent of layout and the language of the text. It also provides a knowledge base that is claimed to perform detailed searches on 10+ billion entities. The company was the first startup funded by Stanford’s SSE Ventures.