Press Release  |  November 20, 2014

ClearStory Data, Databricks and Cloudera to Host Bay Area Apache Spark Meetup

Event will Focus on New Developments in Apache Spark and Key Use Cases for Apache Spark-based Analytics

MENLO PARK, CA. — November 20, 2014 – ClearStory Data, the company that’s bringing Data Intelligence to everyone, today announced that ClearStory, Databricks and Cloudera will host the next Bay Area Apache Spark Meetup. The event will take place at ClearStory HQ in Menlo Park, CA on Tuesday, December 2, 2014 at 6:00 PM – 9:00 PM PT. There’s been an overwhelming response and the event has already reached capacity.

This interactive Meetup for Bay Area Apache Spark users, enthusiasts and explorers will bring together the business and technology community to exchange ideas, inspire and learn from each other about capabilities and use cases for Apache Spark. The event will include two presentations, featuring speakers from ClearStory, DataBricks and Cloudera.

Mikhail Bautin and Mark Hamstra will discuss how ClearStory’s Apache Spark-powered data harmonization and data blending technology delivers fast, scalable and highly interactive analysis. They will also take a deep-dive into a business use case of one of their customers. Databricks’ Tathagata Das and Cloudera’s Hari Shreedharan will present about the Design of Apache Spark Streaming High Availability. Spark Streaming extends Apache Spark’s power to real-time processing of data. Das and Shreedharan will discuss the design and implementation of a solution that will bring high availability with no data loss to Spark Streaming.

Apache Spark is an open-source in-memory data analytics cluster computing framework. ClearStory was involved with Spark in the early days when it was still a project at Cal Berkley’s AMPLab. ClearStory is the first commercial data analytics platform built on top of Spark and today is one of the most visible success stories for Spark.

Who:             Mikhail Bautin and Mark Hamstra, Software Engineers, ClearStory Data
Tathagata Das, Software Engineer, Databricks
Hari Shreedharan, Software Engineer, Cloudera

What:             Bay Area Spark Meetup: ClearStory Use Case + HA Spark Streaming

When:        Tuesday, December 2, 2014 at 6:00 – 9:00 PM PT

Where:           ClearStory Data, 4300 Bohannon Drive, Suite 200, Menlo Park, CA

For more information on the meetup or to be added to the waitlist, please visit


About ClearStory Data

A Gartner Cool Vendor in Big Data for 2014, ClearStory Data is bringing next-generation Data Intelligence to everyone in order to accelerate the way businesses get answers across any number of data sources. By dramatically simplifying data access to internal and external sources, harmonizing disparate data on-the-fly, and enabling fast, collaborative exploration, ClearStory Data’s end-to-end solution includes an integrated platform and incredibly simple user application. The company is backed by Andreessen Horowitz, DAG Ventures, Google Ventures, Khosla Ventures, and Kleiner Perkins Caufield & Byers (KPCB). To keep up with how the world is becoming more Data Intelligent, visit and follow us on Twitter @ClearStoryData.

Media Contact
Carol Kimura
VP, Marketing
+1 650.322.2408

Related Resources