Big Industries recently entered into a partnership with StreamSets. This San Francisco-headquartered company, founded in 2014, has already built up momentum with its open source offering Data Collector.
Big Industries' blog
Big Industries is the main sponsor and driving force behind the Belgian chapter of the Cloudera User Group. This is a group for Cloudera customers and anyone interested in Cloudera solutions in Belgium to network, share best practices, and exchange ideas around the Cloudera Big Data platform and eco-system.
The aim of this post is to help you getting started with creating a data pipeline using flume, kafka and spark streaming that will enable you to fetch twitter data and analyze it in hive.