It's open-source and community-supported, you can build anything you want, from simple file ingestion to Kafka, S3, etc... The ability to create Process groups and isolate your workloads. The number of prebuilt processors. The flow-based programming comes...
Tracking lineage at a row level is important in data lake ingestion implementation. Can Lineage be controlled at per-row level? Batch transformation performance. Need to Benchmark. May require Kafka
New features and updates. Security in Cloud sharing
It's open-source and community-supported, you can build anything you want, from simple file ingestion to Kafka, S3, etc... The ability to create Process groups and isolate your workloads. The number of prebuilt processors. The flow-based programming comes...
New features and updates. Security in Cloud sharing
Tracking lineage at a row level is important in data lake ingestion implementation. Can Lineage be controlled at per-row level? Batch transformation performance. Need to Benchmark. May require Kafka