StreamSets on Azure HDInsight | Data Exposed

This post has been republished via RSS; it originally appeared at: Channel 9.

Azure HDInsight is a fully-managed cloud service that makes it easy, fast, and cost-effective to process massive amounts of data. Use the most popular open-source frameworks such as Hadoop, Spark, Hive, LLAP, Kafka, Storm, R & more. Azure HDInsight enables a broad range of scenarios such as ETL, Data Warehousing, Machine Learning, IoT and more.   

StreamSets Data Collector deploys on top of Azure HDInsight application. It provides a full-featured integrated development environment (IDE) that lets you design, test, deploy, and manage any-to-any ingest pipelines that mesh stream and batch data, and include a variety of in-stream transformations - all without having to write custom code. In this video we will learn on how you can install StreamSets, ingest data from multiple sources and monitor your data pipelines

Azure HDInsight application platform: Install solutions built for the Apache Hadoop ecosystem

Install custom HDInsight applications

Try SDC from StreamSets on HDInsight

Streamsets website

Leave a Reply

Your email address will not be published. Required fields are marked *


This site uses Akismet to reduce spam. Learn how your comment data is processed.