Spark Speeds Towards the Next Data Processing Revolution

A mighty flame followeth a tiny spark. — Dante Alighieri If you know anything about Apache Spark, you know that its chief claim to fame is speed. With in-memory processing, Spark promises ten- or hundred-fold improvements or more in data processing times over traditional MapReduce. Spark is also more flexible, supporting a wider variety of workloads than… Read more »

A Little Spark to Wildfire

Open Source Project Birthed at U.C. Berkeley Takes Off in the Enterprise This week the fast-growing Apache Spark community is gathering in New York City to celebrate and collaborate on one of the most popular open source projects today. Launched in U.C. Berkeley’s AMPLab in 2009, Apache Spark has begun to catch on like wildfire during the last year and a… Read more »

How An Early Bet on Apache Spark Unlocks Data-Driven Transformation

When we first went to look at the Apache Spark data processing engine in 2011, it was a little-known thing inside the University of California, Berkeley’s AMPLab. The project was known to people at the AMPLab, but not to the market at large. Now, that same Apache Spark project is enjoying a meteoric rise in… Read more »

The Rise of Data Governance Comes With the Age of Data 2.0

Back at the turn of the Millennium in the late 1990’s, organizations struggled to solve the problem of who gets access to more information, more applications and more data. As enterprises began amassing more applications and data and began making that available to employees, partners and customers, security considerations escalated and the first single sign-on… Read more »

ClearStory Data and Hortonworks: Enabling Fast-Cycle, Business-Ready Analytics and the “Must-Haves” for Secure, Governed Data Lakes

We are excited to be working with and announcing ClearStory Data’s integration with Hortonworks Data Platform (HDP) during Strata + Hadoop World 2015. This partnership with Hortonworks is significant as it brings ClearStory’s business-ready, fast-cycle, scalable analysis on Hadoop Data Lakes and specifically on the Hortonworks Data Platform (HDP). ClearStory’s integration that includes a data… Read more »

The Dead End of BI Dashboards in New Data Addiction Age

Despite big data being a “quiet star” at C.E.S., wrestling with a huge variety of data from a plethora of new devices and sensors is a big challenge ahead for an Internet of Things’ future. Watching the unveiling of all kinds of new connected devices at the annual Consumer Electronics Show, the Internet of Things… Read more »

2015: Removing Blind Spots and Bringing Data Intelligence to Everyone

In the classic words of an unlikely Big Data pundit, Sherlock Holmes: “I never guess. It’s a shocking habit that’s destructive to the logical faculty.” Ironically, “guessing” is what we do when we get presented with visualizations and charts that were put together with the best intentions to answer the business’s burning questions. That’s because… Read more »

Beyond the “Big”: Solving for Data Variety Requires New Thinking

Big Data is now mainstream at large companies, says a recent study, but many people still dislike the term. Add me to the list. Size is only one part of the Big Data challenge, and the word “big” is preventing organizations from solving for the whole equation—volume, velocity and variety. IT can help the business… Read more »

The 2+2=5 Principle and the Perils of Analytics in a Vacuum

Strategic decision making in enterprises playing in a competitive field requires collaborative information seeking (CIS). Complex situations require analysis that spans multiple sessions with multiple participants (that collectively represent the entire context) who spend time jointly exploring, evaluating, and gathering relevant information to drive conclusions and decisions. This is the core of the 2+2=5 principle…. Read more »

Ending the Era of “Look But Don’t Touch”: Introducing Interactive, Collaborative StoryBoards

We’re very excited to be at the Strata Conference + Hadoop World in New York City this week. Last year we chose this conference to announce our Data Intelligence platform to the world. This year we are unveiling a major new capability that advances data storytelling: ClearStory’s new Interactive, Collaborative StoryBoardsTM powered by Spark. But… Read more »