Since the birth of big data, Cloudera University has been teaching developers, administrators, analysts, and data scientists how to use big data technologies. We have taught over 50,000 folks all of the details of using technologies from Apache such as HDFS, MapReduce, Hive, Impala, Sqoop, Flume, Kafka, Core Spark, Spark SQL, Spark Streaming, and Spark … Continue reading Big Data Architecture Workshop
Whether we find ourselves shopping for the latest tech gadget or something even bigger like a new car, the one thing we rely on to make the right decision is data. Data in the form of a product’s specifications, reviews from the experts, as well as data in the form of comments and reviews from … Continue reading Compliant data freedom: Oxymoron or opportunity?
One of the most important steps for creating data visualizations is selecting which aspects, features or dimensions of the data to present—in other words, letting the data dictate the visualization. Unlike school assignments, data scientists and professionals rarely receive project that provides the same clear guidance they received as children. There is no longer a … Continue reading Data visualization playbook: Determining the right level of detail
Big data is all about Velocity, Variety and Volume, and the greatest of these is Variety. At least it causes the greatest misunderstanding. Variety, in this context, alludes to the wide variety of data sources and formats that may contain insights to help organizations to make better decisions. Everything from our existing database records of … Continue reading Big Data: The Data Variety Discussion
In today’s business world big data can be a vital competitive differentiator for organizations. Traditionally, businesses needed to limit the scope of the data that they could use to make the critical decisions for driving successful business outcomes. Big data solutions have eased many of those limitations, so organizations can look at far more data … Continue reading Building a big data center of excellence
In today’s energy industry, one of the key priorities is finding new ways to cost efficiently keep up with insatiable demands for power, while also delivering renewable energy. You must be able to predict when events will occur and make the first move. Being first to respond to customer or market events could be the … Continue reading Deliver more intelligence to intelligent energy systems
Data already is the new currency and is at the heart of everything digital. I like to repeat the adage, “Data becomes Information, becomes Knowledge, becomes Wisdom”. And “It’s all about the data”. So why do we send up probes, sensors or satellites — for the data? I’d like to touch upon some mission critical … Continue reading Big Data at NASA