1.The concept is still quite new. The term data lake, credited to Pentaho CTO James Dixon, has been bandied about for several years. But the idea of data lakes as corporate resources is still in its infancy, according to IDC analyst Ashish Nadkarni. A data lake is defined as a massive--and relatively cheap--storage repository, such as Hadoop, that can hold all types of data until it is needed for business analytics or data mining. A data lake holds data in its rawest form, unprocessed and ungoverned.
Pentaho - News, Features, and Slideshows
Business intelligence (BI) is frequently among the top prioroties for CIOs and finding the right software to do the job is always a challenge. Cloud-based software may be all the rage, but CIOs must still manage in-house information and make better use of it through analytics and reporting tools. The big four software companies have all made strategic investments in the BI space over recent years and the options have dimnished, but there are alternative tools popping up and snatching a lot of customers in the process. This installment of '5 open source things to watch' is all about BI that doesn't scar the annual report.