Incorporating the Data Lake into Your Analytics Architecture
Share this Session:
  Joe Caserta   Joe Caserta
President
Caserta Concepts
www.casertaconcepts.com
 


 

Wednesday, November 4, 2015
08:30 AM - 09:20 AM

Level:  Introductory


As data continues to increase in volume and variety and come into the enterprise at a velocity never seen before, companies look for solutions to capture the information and maximize its value. Part of that solution growing in popularity is the data lake—a large, easily accessible landing place that holds massive volumes of structured and unstructured data.

The data lake began because new types of data needed to be captured and used by the enterprise. But as this wave of "big data" kept coming, companies that worked for years to create a valuable enterprise data warehouse began to realize that data lakes could complement what was already in place. If the data lake and the enterprise data warehouse are allowed to do what they each do best, they can work together for great gain.

Businesses that successfully combine the capabilities of a data lake with existing data warehouses are poised to break new ground and realize enhanced business value:

  • Understand the distinct benefits of the Data Warehouse and the Data Lake and examine how the combination of the two offers exponential gain
  • Discuss key considerations before getting started (i.e., what to keep in the ETL environment and what to move to the data lake)
  • Learn about technology solutions that exist today to facilitate the integration of the two
  • Look at the developing solutions coming to market


Joe Caserta is a recognized big data strategy consultant, author and educator, and president of Caserta Concepts, an award-winning New York-based innovation technology consulting and implementation firm founded in 2001. His firm provides strategic data solutions to create tailored data strategies and using existing and emerging technologies in big data analytics, data warehousing and business intelligence, to help clients maximize data value. Caserta's industry focus areas include banking and financial services, healthcare, higher education, AdTech, and eCommerce, among others. Joe is co-author of the industry best-selling book, "The Data Warehouse ETL Toolkit" (Wiley 2004), a contributor to industry publications and frequent speaker and expert panelist at industry conferences and events. Joe also serves on the advisory boards of financial and technical institutions and is the organizer and host of the Big Data Warehousing Meetup Group in NYC


   
Close Window