Apache Drill vs. Apache Spark: What’s The Right Tool for the Job?
If you’re looking to implement a big data project, you’re probably deciding whether to go with Apache Spark SQL or…
No Time to Waste! 5 Essential Features for Your Information Intelligence Solution
Strategic information analysis is one of the most important activities that your company can perform. The fruits of this labor,…
Preparing Yourself to Move to Apache Spark
IT is an industry that’s always moving. While some things will never go out of style, like the Unix development…
What Are Accumulators? A Must-Know for Apache Spark
If you’ve been using Apache Spark, then you know how awesome the Resilient Distributed Dataset (RDD) is.
A Guide to Spark Streaming – Code Examples Included
Apache Spark is great for processing large amounts of data over large clusters, but wouldn’t it be great if you…
NoSQL Databases: 4 Game-Changing Use Cases
Sure, you’ve heard about NoSQL, but is it just another technology fad that’s all hype? What can you actually do…
Is Big Data Winning or Losing?
Big data is now used everywhere. AT&T has a database of 312 terabytes, the NSA use 30 million gigabytes a…
Comparing Data Science and Analytics [INFOGRAPHIC]
Data is increasingly the new currency of enterprise—which is why companies are scrambling to tap into its potential to make…
The Apocalypse of Abundance: 5 Steps to End the Insanity of Information Overload
If you're like many of the people I know, the things you once enjoyed most about the Internet now make…
Automate the Boring But Essential Parts of Your Data Warehouse
If you are in IT and responsible for your company’s data warehouse and reporting capabilities, chances are you will identify…