Introducing G2.ai, the future of software buying.Try now
Cloudera Data Flow
Show rating breakdown
Save to My Lists
Unclaimed
Unclaimed

Top Rated Cloudera Data Flow Alternatives

MATLAB
(737)
4.5 out of 5

Cloudera Data Flow Reviews & Product Details

Cloudera Data Flow Overview

What is Cloudera Data Flow?

Cloudera DataFlow (CDF), formerly Hortonworks DataFlow (HDF), is a scalable, real-time streaming analytics platform that ingests, curates, and analyzes data for key insights and immediate actionable intelligence.

Cloudera Data Flow Details
Show LessShow More
Product Description

Cloudera DataFlow (CDF), formerly Hortonworks DataFlow (HDF), is a scalable, real-time streaming analytics platform that ingests, curates, and analyzes data for key insights and immediate actionable intelligence.


Seller

Cloudera

Description

Cloudera is a service provider of enterprise-grade, global data management and analytics software solutions. The company delivers a modern platform for machine learning and analytics optimized for the cloud. Cloudera's offerings enable organizations to efficiently capture, store, process, and analyze vast amounts of data, helping them use advanced data-driven insights to drive business decisions and innovation.The company's platform is designed to work in hybrid and multi-cloud environments, providing flexibility to run a variety of workloads across different clouds and on-premises environments. It supports numerous use cases from the Edge to AI, empowering businesses to transform complex data into actionable insights.Cloudera's solutions are trusted by industries ranging from healthcare and finance to retail and telecommunications, emphasizing its commitment to security and compliance. Their comprehensive support, training, and professional services ensure that clients are well-equipped to implement and maintain robust data solutions.

Recent Cloudera Data Flow Reviews

Bidisha P.
BP
Bidisha P.Enterprise (> 1000 emp.)
3.5 out of 5
"CDF review"
Cloudera Data Flow(CDF) provides us a single platform for analysis of real time streaming data. We mostly use CFM, CEM to push agents data and Kafk...
Aditya K.
AK
Aditya K.Enterprise (> 1000 emp.)
4.0 out of 5
"Cloudera Data Flow(CDF) honest reviews"
We are leveraging Kafka of Cloudera Data flow for streaming analytics. CDF provides us real time data which is critical for producing live dashboar...
Verified User
U
Verified UserMid-Market (51-1000 emp.)
3.0 out of 5
"Close to success"
Hortonworks two main pillars are HDP (Hortonworks Data Platform) and HDP (Hortonworks Data Flow). The former applies to the infrastructure required...

Cloudera Data Flow Media

Answer a few questions to help the Cloudera Data Flow community
Have you used Cloudera Data Flow before?
Yes

3 Cloudera Data Flow Reviews

3.5 out of 5
The next elements are filters and will change the displayed results once they are selected.
Search reviews
Hide FiltersMore Filters
The next elements are filters and will change the displayed results once they are selected.
The next elements are filters and will change the displayed results once they are selected.
3 Cloudera Data Flow Reviews
3.5 out of 5
3 Cloudera Data Flow Reviews
3.5 out of 5
G2 reviews are authentic and verified.
Aditya K.
AK
Lead Software Engineer
Enterprise(> 1000 emp.)
More Options
Validated Reviewer
Verified Current User
Review source: G2 invite
Incentivized Review
Rating Updated ()
What do you like best about Cloudera Data Flow?

We are leveraging Kafka of Cloudera Data flow for streaming analytics. CDF provides us real time data which is critical for producing live dashboards and also the amount of data streaming (in petabytes) helps us to have CDF as one stop shop for live data analysis Review collected by and hosted on G2.com.

What do you dislike about Cloudera Data Flow?

Kafka of CDF although is scalable however it has a lot of lag problems and needs complex tuning. When the lag occurrs that is the current offset is more than consumer end offset, a lag in 6-7 figures can be seen that means the stale records reaches to around 1 million at times due to which the dashboard waits for latest data and it sometimes takes hours to fetch that and sometimes restart of service is also required to fix that Review collected by and hosted on G2.com.

What problems is Cloudera Data Flow solving and how is that benefiting you?

Cloudera Data flow helps us to produce and consume millions of streaming records based on which our live dashboards as well as reports are created. CDF UI helps to manage the service easily as we do not have to login to the service each time we need to make configuration changes Review collected by and hosted on G2.com.

Bidisha P.
BP
Senior Speclialist (Vendor Master Data)
Enterprise(> 1000 emp.)
More Options
Validated Reviewer
Verified Current User
Review source: G2 invite
Incentivized Review
Rating Updated ()
What do you like best about Cloudera Data Flow?

Cloudera Data Flow(CDF) provides us a single platform for analysis of real time streaming data. We mostly use CFM, CEM to push agents data and Kafka to push live data which is then consumed by spark and after cleaning the financial reports are created. Review collected by and hosted on G2.com.

What do you dislike about Cloudera Data Flow?

Kafka which was earlier a part of CDP(cloudera data platform) has been moved to CDF which makes us buy a separate subscription and hence incur more costs to the project. This was a smart move by Cloudera to make more money but surely hurts us as the service that we used along with CDP now has to be purchased as it comes under CDF umbrella Review collected by and hosted on G2.com.

What problems is Cloudera Data Flow solving and how is that benefiting you?

Reports creation

Data Analysis

Data Cleansing

Dashboards creation using Kafka messages

Intuitive UI which helps us configure and manage the services in one place Review collected by and hosted on G2.com.

Verified User in Real Estate
UR
Mid-Market(51-1000 emp.)
More Options
Validated Reviewer
Review source: G2 invite
Incentivized Review
Rating Updated ()
What do you like best about Cloudera Data Flow?

Hortonworks two main pillars are HDP (Hortonworks Data Platform) and HDP (Hortonworks Data Flow). The former applies to the infrastructure required for building and deploying a data lake, and the latter is about ingestion, in batch or realtime.

Both HDP and HDF rely entirely on opensource projects, this is a distinctive point about Hortonworks. Review collected by and hosted on G2.com.

What do you dislike about Cloudera Data Flow?

As an open source project collection, it relies strongly on community activity. You still have the option to contract premium consulting or training services.

Altough it is quickly evolving into Data Science tools availability (eg. Tensorflow incorporate in HDP 3), it can be cumbersome from a developer transitioning from a traditional IDE, into the notebook vs. datalake metaphore. Review collected by and hosted on G2.com.

Recommendations to others considering Cloudera Data Flow:

Because of its open source platform. Make sure that it has the right integration into your current data field. Review collected by and hosted on G2.com.

What problems is Cloudera Data Flow solving and how is that benefiting you?

Typically it is used as an enterprise platform. There are very few companies that use it only departmentally. It solved the business problems of maintaining a pure open source Hadoop environment. It also solves for Disaster Recovery and Security. Hadoop was not designed for Security, but with Hortonworks Ranger and Kerberos, you can implement a world class security framework. Review collected by and hosted on G2.com.

There are not enough reviews of Cloudera Data Flow for G2 to provide buying insight. Below are some alternatives with more reviews:

1
MATLAB Logo
MATLAB
4.5
(737)
MATLAB is a programming, modeling and simulation tool developed by MathWorks.
2
Google Cloud BigQuery Logo
Google Cloud BigQuery
4.5
(1,146)
Analyze Big Data in the cloud with BigQuery. Run fast, SQL-like queries against multi-terabyte datasets in seconds. Scalable and easy to use, BigQuery gives you real-time insights about your data.
3
Alteryx Logo
Alteryx
4.6
(637)
Alteryx drives transformational business outcomes through unified analytics, data science, and process automation.
4
Snowflake Logo
Snowflake
4.6
(624)
Snowflake’s platform eliminates data silos and simplifies architectures, so organizations can get more value from their data. The platform is designed as a single, unified product with automations that reduce complexity and help ensure everything “just works”. To support a wide range of workloads, it’s optimized for performance at scale no matter whether someone’s working with SQL, Python, or other languages. And it’s globally connected so organizations can securely access the most relevant content across clouds and regions, with one consistent experience.
5
Databricks Data Intelligence Platform Logo
Databricks Data Intelligence Platform
4.6
(608)
Making big data simple
6
HubSpot Operations Hub Logo
HubSpot Operations Hub
4.5
(467)
HubSpot Operations Hub allows you to keep all your contacts in 2-Way, Real Time Sync no matter if you use (Gmail/Outlook, Salesforce, Pipedrive, Constant Contact, Prosperworks, HubSpot, MailChimp or ActiveCampaign to name a few).
7
Spotfire Analytics Logo
Spotfire Analytics
4.2
(356)
Self-service data discovery. Fastest to actionable insight. Collaborative, predictive, event-driven data analysis - free from IT.
8
Teradata Vantage Logo
Teradata Vantage
4.3
(350)
The Teradata Database easily and efficiently handles complex data requirements and simplifies management of the data warehouse environment.
9
Tealium Customer Data Hub Logo
Tealium Customer Data Hub
4.3
(351)
Tealium AudienceStream™ is the market-leading Customer Data Platform, combining robust audience management and data enrichment capabilities resulting in unified customer profiles and the ability to take immediate, relevant action.
10
Qubole Logo
Qubole
4.0
(259)
Qubole delivers a Self-Service Platform for Big Data Analytics built on Amazon, Microsoft and Google Clouds
Show More