ClearStory + Spark = Data Exploration Freedom

The release of Spark 1.0 marks a significant step in the move away from MapReduce based big data processing. In-memory. Distributed. Scale out. Machine Learning. 100X faster – on initial benchmarks and our Spark-inside Solution is evidence of the blazing speed. Data Scientists and Data Engineers are rejoicing – and drooling. Although for some users… Read more »

On the Meaning of Spark 1.0

The Spark 1.0 release marks some significant changes, and it is worth taking some time to consider just what this release means for users and developers of Spark and what they can now expect from the project. The answer to that question has clearly changed over time, and not all of that change is completely… Read more »

Fast Data Insights with Amazon Redshift & ClearStory Data (Webinar)

Webinar: Tuesday, June 3rd. Register to See Interactive Analysis Capabilities and Speed of Analysis in Action ClearStory Data will demonstrate how, when combined with Amazon Redshift, the solution allows users to quickly bring together data from internal and external sources to uncover new insights. To accelerate visual insights, the solution eliminates costly data manipulation and… Read more »

2014: The Year of Data Intelligence for Everyone

Today, the ClearStory team is pleased to announce we’ve raised $21 million in Series B from Andreessen Horowitz, Google Ventures, DAG Ventures, Khosla Ventures and Kleiner Perkins Caufield & Byers. With this round, we are poised to scale rapidly and make ClearStory’s Data Intelligence solution available to companies large and small. Fundamentally, we believe that… Read more »

ClearStory and Databricks at the Spark Summit

To follow up on the success of the Spark Summit in San Francisco this past week, I wanted to share a great conversation I had with Reynold Xin who is one of the co-founders of Databricks and the main author behind Shark. Here’s a discussion capturing highlights of our chat at the Summit, in which we discuss how… Read more »

Spark Summit 2013

On the eve of the very first Spark Summit, there is a lot to be excited about. The past few months have been quite eventful for the Spark community – The Apache Software Foundation accepted Spark as an official incubator project and graduated Apache Mesos to a Top-Level Project, Databricks was founded to commercialize Spark… Read more »

Introducing “Data Intelligence”

Data is the lifeblood of business. And so every company wants unfettered access to the volumes of diverse data inside their enterprise. But they also want to be able to build on the insights gained from that internal information by continuously blending in external data. Data availability is not the problem – in fact, it’s… Read more »

A New Analytic Technology Stack for Scalable, Interactive Analysis

I was thrilled to see the public announcement of Databricks last week and Spark taking off with strong support from Andreessen Horowitz. Spark, for those who haven’t heard of it yet, is an open source cluster computing framework that is designed to make data analytics fast and boasts performance numbers 100x faster than traditional MapReduce… Read more »

Picking Startups and Leadership Teams

As someone who has helped grow a number of startup companies and who has invested in a few dozen more, I’m often asked how I go about picking companies that I think show the promise of success. It’s an important question, and not just for entrepreneurs or early investors. It’s also one that everyone who… Read more »

Discussing “Big Data” with Thomas Friedman

The New York Times “Global Forum” held in San Francisco last week by New York Times columnist Thomas Friedman gave me a chance to briefly put aside the technical discussion about “Big Data” that we in the technology community so often obsess about, and to instead contemplate the far-reaching and high-level changes big data is… Read more »