Introducing G2.ai, the future of software buying.Try now
Diffbot
Show rating breakdown
Save to My Lists
Claimed
Claimed

Top Rated Diffbot Alternatives

Clearbit
(626)
4.4 out of 5
ZoomInfo Sales
(8,759)
4.5 out of 5

Diffbot Reviews & Product Details - Page 2

Diffbot Overview

What is Diffbot?

Diffbot provides a suite of products built to turn unstructured data from across the web into structured, contextual databases. Diffbot's products are built off of cutting-edge machine vision and natural language processing software that's able to read billions of documents every day. Diffbot Knowledge Graph Diffbot's Knowledge Graph product is the world's largest contextual database comprised of over 10 billion entities including organizations, products, articles, events, and more. Knowledge Graph's innovative NLP and fact parsing technologies link up entities into contextual databases, incorporating over 1 trillion "facts" from across the web in nearly live time.

Diffbot Details
Languages Supported
English
Show LessShow More
Product Description

Automatic data extraction from articles, products, discussions and more.


Seller

Diffbot

Description

Diffbot is a cutting-edge AI company that specializes in extracting structured data from the web. Utilizing advanced machine learning and computer vision techniques, Diffbot's technology automatically turns unstructured web content into actionable data, capable of powering a wide variety of applications from market intelligence to business analytics.Diffbot provides its services through an API suite that includes the Knowledge Graph, Crawlbot, and various extraction APIs. These tools are designed to automate data collection and analysis, helping businesses to understand vast amounts of web data with minimal manual intervention. The Knowledge Graph, one of Diffbot's flagship offerings, integrates data from billions of web pages to create a comprehensive, continuously-updated database of global knowledge.

Overview Provided by:
CEO at Diffbot

Recent Diffbot Reviews

JW
Justin W.Mid-Market (51-1000 emp.)
4.0 out of 5
"The most Competant Web Crawling Service I've used"
Overall, Diffbot's tools are simple to use and understand outside of more complex use cases. We use several of their features to deliver content in...
KL
Kurt L.Small-Business (50 or fewer emp.)
5.0 out of 5
"Diffbot is a game-changer."
Diffbot makes the difficult task of managing data and extracting useful information much easier. They provide access to a seemingly infinite amount...
Verified User
A
Verified UserSmall-Business (50 or fewer emp.)
4.5 out of 5
"Diffbot Increases Efficiency"
Prior to using Diffbot, we relied primarily on RSS feeds and a web scraping tool that is based on the visual layout and HTML of a webpage. We were ...

Diffbot Media

Diffbot Demo - Knowledge Graph Product View
Diffbot's Knowledge Graph provides billions of product, article, organization, people, and other entity types with fields populated by our AI-enabled web extraction tech.
Diffbot Demo - Enhance Excel Integration
Diffbot Enhance provides data enrichment on organizations and people of interest. With over 127 million organizational entries from Diffbot's Knowledge Graph, you can enrich data profiles from minimal data with ease.
Answer a few questions to help the Diffbot community
Have you used Diffbot before?
Yes

29 Diffbot Reviews

4.9 out of 5
The next elements are filters and will change the displayed results once they are selected.
Search reviews
Hide FiltersMore Filters
The next elements are filters and will change the displayed results once they are selected.
The next elements are filters and will change the displayed results once they are selected.
29 Diffbot Reviews
4.9 out of 5
29 Diffbot Reviews
4.9 out of 5
G2 reviews are authentic and verified.
James C.
JC
Manager, Data Team
Small-Business(50 or fewer emp.)
More Options
Validated Reviewer
Review source: Organic
What do you like best about Diffbot?

Diffbot can augment data streams for SO MANY industries/use cases. Within ours we're able to keep track of news mentions on universities (from literally all over the web), and enrich leads for outreach. I'm sure there's a ton more we could be doing with Diffbot. But even with those uses the service has paid for itself many times over. It doesn't take many saved work hours to justify the $299 price tag... Review collected by and hosted on G2.com.

What do you dislike about Diffbot?

To tap into the full power of Diffbots offerings you do need a technical team member. (But for what service is this not the case?) Basically you can deal with pre-extracted sites (of which there seem to be millions) with the Knowledge Graph and Enhance. If you want to crawl a specific site repeatedly you'll need to at least know hot to make an API call. Review collected by and hosted on G2.com.

What problems is Diffbot solving and how is that benefiting you?

High level we're using Diffbot for data extraction. More specifically enriching lead data and monitoring news sources about a large group of organizations.

In the past we've built custom scrapers. but even with a (albeit small) data team the upkeep required to monitor even scores of sites made projects balloon in complexity and cost. The fact that we have multiple entry points to data streams about web properties that matter to us is HUGE. Review collected by and hosted on G2.com.

BE
Mid-Market(51-1000 emp.)
More Options
Validated Reviewer
Review source: Organic
What do you like best about Diffbot?

Diffbot's Extraction APIs and Crawlbot API provide an incredibly valuable, versatile, and simple to use pipeline for acquiring crucial information from web pages that may not have been visited before. The Analyze API makes it a snap to determine if the page in question is a product page or not, and the wide array of elements that Diffbot returns from most pages is exceptionally useful! Review collected by and hosted on G2.com.

What do you dislike about Diffbot?

In our space, we tend to cover a large percentage of the e-commerce world, and that takes us to many domains that are either irregular, outdated, or less than perfect in terms of function. We've noticed that for those pages, or ones with domains that have sophisticated/aggressive bot blocking techniques that Diffbot will often fail to provide a result (or at least within a minute or two). This can be problematic for a company like ours that explores tens of thousands of domains each day as it can slow down our discovery pipeline that finds new listings and e-commerce domains. Review collected by and hosted on G2.com.

What problems is Diffbot solving and how is that benefiting you?

We typically use Diffbot to aid in providing data elements that we need in machine learning and AI, but would be too costly to spend the human-hours creating selectors for. Additionally, we use the Crawlbot API to help us get wider coverage of certain sites, while still leveraging the power of the automated extraction tools that Diffbot offers. Review collected by and hosted on G2.com.

Eric S.
ES
Chief Scientist
Small-Business(50 or fewer emp.)
More Options
Validated Reviewer
Review source: Organic
What do you like best about Diffbot?

We needed a content sourcing solution for our product, Tanjo Animated Personas, or TAPs. Tanjo Animated personas are simulated customers that learn and evolve over time. Our personas need to read a continual stream of articles, in order to evolve and function properly. Diffbot gives us an easy way to source that content.

We have been a Diffbot customer for over 5 years, and have used all of their products, including Crawlbot and Knowledge Graph. Before Diffbot, we mainly relied on RSS feeds and custom scrapers to import articles into our system. The results were often inconsistent, with misread or malformed text blocks. It was tedious and unsustainable. Diffbot provided an almost limitless set of sources with high quality data.

Implementing Diffbot has greatly improved scalability, efficiency and quality of feeding internet articles into our platform. They are always willing to work with us if we encounter any issues. They take customer feedback seriously and are willing to hear out suggestions for what features could be improved or added. We appreciate Diffbot’s flexibility to work with us for our needs. Review collected by and hosted on G2.com.

What do you dislike about Diffbot?

Diffbot has always been open to hearing our suggestions for what could be improved or added to their website. I don't think it would be fair to "dislike" anything since they have taken our feedback seriously in the past and iterated on their platform. If we think things could be better, we let Diffbot know. Review collected by and hosted on G2.com.

What problems is Diffbot solving and how is that benefiting you?

We needed an automated method to extract article text and images from popular websites online that was much more reliable and required much less effort to maintain. Diffbot provides an almost limitless set of sources with high quality data. Review collected by and hosted on G2.com.

Verified User in Internet
UI
Mid-Market(51-1000 emp.)
More Options
Validated Reviewer
Verified Current User
Review source: Organic
What do you like best about Diffbot?

Their support team is very helpful. Even without purchasing their support plan to have an SLA, they usually get back within a week and provide thorough responses. Sometimes, they'll even see your API configuration, adjust it for you, and explain how the new setting is better.

I would highly recommend Diffbot for their robust and dependable products, supportive sales and customer support staff, and transparent pricing plans. Even their base plans make it easy for any company or team of any size to test it and determine what their positive ROI looks like. Review collected by and hosted on G2.com.

What do you dislike about Diffbot?

Documentation could be improved a bit. It can be hard for new users who aren't familiar with HTML and CSS how to apply specific filters and selectors. My recommendation here is to provide templates or additional documentation on best practices for scraping data from popular sources such as Wikipedia.

Another small thing they can improve on is providing better visibility into account usage statistics for accounts with multiple tokens, which are all tied into one parent account. Review collected by and hosted on G2.com.

What problems is Diffbot solving and how is that benefiting you?

Their data extraction APIs are customizable and flexible. Almost any page on the internet can be scraped. It expedites data extraction for our team as we don't need to depend on custom python scripts or software engineers to help collect data for our needs. We were able to reduce time from days to mere hours to get working APIs to extract data. For a startup that is now part of a much larger company, this type of efficiency helped us allocate our engineers to more important sprints. Review collected by and hosted on G2.com.

Henry V.
HV
Small-Business(50 or fewer emp.)
More Options
Validated Reviewer
Review source: Organic
What do you like best about Diffbot?

Diffbot provides a simple, well documented API that allows for mind-boggling web scraping with brain-dead code. By finding what's important on nearly every kind of webpage, Diffbot helped launch my project further than I could have imagined, saving me hours writing code which would have only been able to understand a few websites. Review collected by and hosted on G2.com.

What do you dislike about Diffbot?

One suggestion for them is, there are probably individuals/small businesses out there that can't afford the plans they offer, that could still get a lot out of Diffbot, so maybe they should consider adding a smaller plan. But as a user I haven't encountered anything to dislike yet- really! Haven't had a single issue using the API and it was really easy to get started with all of their help. Review collected by and hosted on G2.com.

What problems is Diffbot solving and how is that benefiting you?

Several times a day, we're scraping URLs which are dynamically chosen by a program and pulling data from those web pages. Since we don't know which sites will be scraped in advance, it's a daunting programming task to reliably scrape the important info from any given webpage. Diffbot does this job reliably with any web page we encounter. Ultimately it gives us a ton of mental space to tackle other important aspects of my program, rather than muck around in the mess of web code. Review collected by and hosted on G2.com.

Verified User in Defense & Space
UD
Small-Business(50 or fewer emp.)
More Options
Validated Reviewer
Verified Current User
Review source: Organic
What do you like best about Diffbot?

Diffbot is powerful and simple to use. Users from basic to advanced levels of technical expertise can use Diffbot and extract content from the web with ease. Diffbot is highly scale-able because it is so easy to extract content from the web. The pricing is better than other software we have used before. The customer support has been superb. We almost always receive responses from the support team within 24 hours after their submission. The support team works hard to give timely and accurate suggestions and fixes for issues we face. The onboarding process was very smooth. Diffbot provided us with a generous trial amount that really allowed us to evaluate Diffbot and see that it was the right solution for us. The user interface is simple and sleek. Many tasks on Diffbot can be automated making management of hundreds of crawlers or other extraction APIs fairly effortless. Diffbot has been everything we hoped for in web extraction. Review collected by and hosted on G2.com.

What do you dislike about Diffbot?

Monitoring the success of crawlers is challenging since there are not notifications on whether a crawler has not been delivering for a while or meeting a lot of errors. Review collected by and hosted on G2.com.

What problems is Diffbot solving and how is that benefiting you?

Ingesting open source news content from hundreds of sites. Diffbot automates a huge portion of the web scraping process and it is highly accurate with the option to adjust the ingestion fields for specific needs or for nonstandard site formats. With the service we used before switching to Diffbot, it would take about a half hour to create an extractor for a website, but with Diffbot we were able to do a site it in about 5-10 minutes. We have not taken advantage of the Knowledge Graph very much yet since we have been focused on the extraction solution. But, that also looks promising for our use case. Review collected by and hosted on G2.com.

Andres P.
AP
Mid-Market(51-1000 emp.)
More Options
Validated Reviewer
Verified Current User
Review source: Organic
What do you like best about Diffbot?

We have used Diffbot for several years, their API for text extraction is extremely powerful and accurate. It has become an important part of our data processing pipeline. Their API(s) allow us to convert unstructured HTML data into information we can ingest and store.

Their support is also very responsive and has always provide us with value answers and feedback when needed. Review collected by and hosted on G2.com.

What do you dislike about Diffbot?

They also provide with a web interface to define custom rules, that functionality has also proved very useful, however its UI can be not very intuitive sometimes. Review collected by and hosted on G2.com.

What problems is Diffbot solving and how is that benefiting you?

It allows us to extract structured data from HTML pages. Review collected by and hosted on G2.com.

Verified User in Venture Capital & Private Equity
UV
Small-Business(50 or fewer emp.)
More Options
Validated Reviewer
Review source: Organic
What do you like best about Diffbot?

1) Enrichment data

2) Ability to query data in aggregate Review collected by and hosted on G2.com.

What do you dislike about Diffbot?

1) Being charged based on entities

2) Being charged as we go (I wish there was a way to limit my queries) Review collected by and hosted on G2.com.

What problems is Diffbot solving and how is that benefiting you?

Lead enrichment

Lead sourcing

Customer profiling Review collected by and hosted on G2.com.

AR
Small-Business(50 or fewer emp.)
More Options
Validated Reviewer
Verified Current User
Review source: Organic
What do you like best about Diffbot?

We're a happy customer for about 6 years now, and we tend to forget Diffbot is there, since their data flows seaminglessly. Our work depends a lot on data processing, and we don't want to worry about how data sources provide their data, or when change their process along the way. With Diffbot we can really focus on processing. Review collected by and hosted on G2.com.

What do you dislike about Diffbot?

Nothing worth mentioning. The few glitches we had in the past were promptly dealt by their support. Review collected by and hosted on G2.com.

What problems is Diffbot solving and how is that benefiting you?

We're using data extraction APIs for getting web data. We're evaluating the knowledge graph. Review collected by and hosted on G2.com.

Laura L.
LL
Small-Business(50 or fewer emp.)
More Options
Validated Reviewer
Verified Current User
Review source: Organic
What do you like best about Diffbot?

There are mulitple ways to "extract" data with Diffbot. We use the Knowledge Graph (which doesn't really require any knowledge of extraction or web scraping on our end) for exploratory analysis. And for more redudent scrapes the automatic extraction API. Solid documentation and the knowledge graph works right out of the box. Review collected by and hosted on G2.com.

What do you dislike about Diffbot?

There is a bit of a learning curve with DQL Review collected by and hosted on G2.com.

What problems is Diffbot solving and how is that benefiting you?

The ability to pull in data about brands, products, and news mentions to see trends.

We were able to ditch rule-based scrapers (that only worked some of the time). Review collected by and hosted on G2.com.