Data Governance in hadoop environments

Big Industries' blog

 

Cloudera User Group.jpg                                  Meetup logo.png

 

Belgium Cloudera User Group Meetup

Big Industries, as main sponsor of the Belgium Cloudera User Group, organised on Wednesday February 8th, 2017 a Meetup in our offices at Cronos in Kontich with Data Governance in Hadoop Environments as central topic.

The User Group is aimed at ClouderaCustomers and users and everyone who wants to learn more about the Hadoop eco-system. We aim at combining talks about future Hadoop roadmaps with stories from the trenches.

If you would like to participate to the meetup, please register on the official Meetup Page.

Data Governance in Hadoop Environments

Agenda:

6:30 PM: Welcome and Sandwiches

7:00 PM: Effective Data Wrangling with Trifacta

Trifacta is a data preparation application that enables users to transform complex data into structured formats for analysis. 

With this tool users are able to interactively explore the content of their data and trough a process called predictive transformation, define a recipe for how the dataset should be transformed. This logic is used to define how the data is processed either on your desktop, server, cloud environment or Hadoop

Speaker: Bert Oosterhof, EMEA Field CTO at Trifacta

7:45 PM: Cloudera Navigator: Data Governance solution for Hadoop

Cloudera Navigator is a complete data governance solution for Hadoop, offering critical capabilities such as data discovery, continuous optimization, audit, lineage, metadata management, and policy enforcement. As part of Cloudera Enterprise, Cloudera Navigator is critical to enabling high-performance agile analytics, supporting continuous data architecture optimization, and meeting regulatory compliance requirements.

Speaker: Emre Sevinç: Big Data Architect, Big Industries

8:30 PM: Cloudera Optimizer Demo

Cloudera Navigator Optimizer helps optimize inefficient query workloads for best results on Apache Hadoop.

This tool profiles and analyzes the SQL text in large, complex SQL workloads so users can gain an in-depth understanding of their workloads, identify queries best-suited for Hadoop and modify them as needed for optimal efficiency on Hadoop—all via an easy-to-use web UI. 

Speaker: Wim Villano: Sales Engineer, Cloudera

9:00 PM: End of the session

The slides are available for download: Click Here 

 

Posted by Matthias Vallaey on Jan 25, 2017 1:25:20 PM

Matthias Vallaey

About the Author

Matthias is founder of Big Industries and a Big Data Evangelist. He has a strong track record in the IT-Services and Software Industry, working across many verticals. He is highly skilled at developing account relationships by bringing innovative solutions that exceeds customer expectations. In his role as Entrepreneur he is building partnerships with Big Data Vendors and introduces their technology where they bring most value.