New Internship: Building Realtime data pipelines with streamsets

Big Industries' blog

Internship_Big_Industries.jpg

Big Industries is the foremost one-stop advanced systems integration partner for Hadoop and NoSQL in Benelux. Our client base covers telco, financial services, public sector, transport & logistics, media and pharmaceuticals, and we are continuously looking for ways to reinvent, improve and refine our service offer.

Therefore Big Industries is now looking for motivated interns to help us to prepare and build high quality reference architectures for use as accelerator templates for customer projects.

The objective of the internship will be to help us to create reusable architecture blueprints that can help us rapidly build successful, high quality and robust customer implementations time and again.

The following design blueprints should be developed, deployed, secured, demonstration integrations built, the solution should be stress and soak as well as functionally tested, validated and documented (including constraints, limitations and lessons learned):

Apache_Hadoop_Logo.jpgApache_Hive_Logo.png   Apache_Impala_Logo.png      Apache_Cassandra_Logo.pngApache_Kudu_Logo.jpg StreamSets-logo.png

  • Hadoop and Hive based data warehouse cluster with near realtime continuous data ingestion using StreamSet
  • Hadoop and Impala/Kudu based data warehouse cluster with near realtime continuous data ingestion using StreamSets
  • Cassandra based data warehouse cluster with near realtime continuous data ingestion using StreamSets

Interns are expected to be self-starters able to manage a small project, with an appetite for BI, data integration and architecture; and will gain exposure to industry leading enterprise and open source data integration, data warehousing and data visualization technologies.

 

Interested in this Internship?

 

Posted by Matthias Vallaey on Oct 3, 2016 2:50:35 PM

Matthias Vallaey

About the Author

Matthias is founder of Big Industries and a Big Data Evangelist. He has a strong track record in the IT-Services and Software Industry, working across many verticals. He is highly skilled at developing account relationships by bringing innovative solutions that exceeds customer expectations. In his role as Entrepreneur he is building partnerships with Big Data Vendors and introduces their technology where they bring most value.