Open Source
Preserving Big Data to Live Forever
Preserving knowledge for generations is no easy task. Key components of this massive undertaking include decisions in technology, architecture, data storage, and data accessibility. What are steps to architect a solution to keep your own data safeguarded and accessible long-term?[read more]
Hadoop Toolbox: When to Use What
Hadoop and Big Data have almost become synonymous. But Hadoop is not just Hadoop now. Over time it has evolved into a big herd of various tools, each meant to serve a different purpose. But glued together they give you a powerpacked combo. Here's my short intro to some very useful tools.[read more]
Hadoop + Ubuntu: The Big Fat Wedding
Now, here is a treat for all you Hadoop and Ubuntu lovers. Last month, Canonical, the organization behind the Ubuntu operating system, partnered with MapR, one of the Hadoop heavyweights, in an effort to make Hadoop available as an integrated part of Ubuntu through its repositories.[read more]
Software, IP Protection, Innovation and the Apple - Samsung Verdict
From a software perspective the verdict has some interesting ramifications. Fast following, a business model that is pretty common in the software industry, gets quite a bit more risky and difficult. It's long been the practice for companies to find successful software and then add a business model twist to producing a similar product.[read more]
Technology Obsolescence
Every time a new version comes in the market the usability to the users takes a toll....If a user is never confident and never can master the tool simply because by the time she gets a hand on it the tool changes, then the user suffers under an inferiority complex – the quality of work takes a massive toll and the objective of using the tool to solve business problems falls flat - the entire investment goes for a loss.[read more]
Data Visualization's Final Frontier
What is the data visualization ||-coord method? Who does it, and how? A detailed explanation.[read more]
The Diary of a Construction Manager in Love with His Business Intelligence Solution
Project managers across all industries can greatly benefit from a business intelligence tool to make their jobs easier and to make smarter, data-driven business decisions. Find out what results Bob the construction manager encountered when using a BI tool.[read more]
The Amazon Effect: Zuora, Citrix and the Acceleration of the Cloud Economy
When Amazon started making their infrastructure and IT processes available outside the company firewall in 2006 it was a tipping point of sorts for business consumerization of the cloud. Companies of all sizes leveraged a variety of Amazon Web Services to store data, stream video, run online stores and a ton of other things. And...[read more]
What Is Hadoop? Here is a 101 with Mike Olson
Mike points out that in the old days if you had a data problem you would write a big check for a massive piece of hardware and with any money left over you would by some very expensive but powerful software from Oracle. That box with software and data became your data temple and your analysis and conclusions were done there.[read more]
Data Design Principles
Taking a practical approach to developing a well-formed enterprise data warehouse – and by that, I mean one that is accurate, efficient and productive – involves basing it on sound design principles.[read more]
Facebook’s Response to Yahoo’s Patent Lawsuit
Countersuing gives Facebook the best chance of fending off Yahoo’s lawsuit – and therefore not rewarding patent lawsuits. And signaling they are only doing so in response to Yahoo (hence might drop the suit if Yahoo does) keeps them on the right side of innovation.[read more]
SDC @ Strata - Doug Cutting on Hadoop, Apache and Open Source
The Apache Software Foundation is a non-profit organization hosting about 100 different projects. It tries to promote the development of healthy communities around projects. Quality and usage, Doug maintains, are present in healthy communities.[read more]
Ring in the New Year with New Data Products
For web-based businesses, and of course, those with a web presence (which is just about everyone) there’s a goldmine of behavioral data accessible with the right tools. The trick is getting past static web analytic reporting (bounce rates, page views, session times etc.) and going further into unlocking the rich treasure trove of machine data, text and weblogs that create “big data” insight.[read more]
The Next Generation BI Professional - Things Will Be Very Different
"The next generation business intelligence professional is not one who has experienced complex deployments, lengthy project time frames and unpredictable results dealing with a big tack of software with more bells and whistles than they really cared about. Nope, the next generation business intelligence is going to be very different in many ways. Let’s explore."[read more]
Master Data Management: Does an Effective Solution Exist?
"Organisations are spending very large amounts of money on their Master Management programmes and they want to ensure their investment is sound. However, in most large organisations, managing master data is a very complex problem that technology alone will not solve."[read more]
The moderated business community for business intelligence, predictive analytics, and data professionals.
Recommended to follow
The Predictive Analytics in the Cloud Study is complete!
Register here to access the full results of this exclsuive study on Predictive Analytics and Cloud Technology including a whitepaper, 2 webinars, multiple podcasts and more!
SmartData Collective

About Social Media Today

















“Mike, we are seeing an increase in businesses seeking specialized skills to help address challenges that arose with the era of big data. The HPCC Systems platform from LexisNexis helps to fill this gap by allowing data analysts themselves to own the complete data lifecycle. Designed by data scientists, ECL is a declarative programming language used to express data algorithms across the entire ...”
“Data variety is indeed both a challenge and an opportunity. I work for Gnip and we provide social data from a variety of sources and are constantly talking about what we call The Social Cocktail. We normalize the streams to help businesses overcome some of the challenges presented in this articles The Curse and Challenge of Data Variety section. Our customers are using multiple data sources ...”