Pentaho's data integration with Storm, YARN goes live


Pentaho Data Integration, or PDI, is now natively integrated with Storm and YARN and went live Feb. 11 at the Strata conference in Santa Clara, CA. This is a big deal because it enables the bypass of the MapReduce batch mode and real-time streaming data integration.

"As an M2M leader in the Internet of Everything, our wireless solutions require innovative technology to bring big data insights to business users," said Bryan Stone, Cloud Platform Lead at Synapse Wireless in a statement to the press. "The powerful combination of Pentaho Data Integration, Storm and YARN will allow my team to immediately leverage real-time processing, without the delay of batch processing or the overhead of designing additional transformations. No doubt this advancement will have a big impact on the next generation of big data analytics."

Stone wasn't the only one applauding this announcement.

"YARN is enabling Hadoop to be used as a flexible multi-purpose data processing and analytics platform," said Matt Aslett, Research Director, Data Management and Analytics at 451 Research in a statement to the press. "We are seeing growing interest in Hadoop not just as a platform for batch-based MapReduce but also rapid data ingestion and analysis, especially using Apache Storm. Native support of Storm and YARN from companies like Pentaho will encourage users to innovate and drive greater value from Hadoop."

Big data tools are advancing in leaps and bounds these days. It's good to see the industry finally begin to take a more mature shape.

Related Articles:
Pentaho, EMA study reveals big data priorities for 2014
Pentaho, Rackspace, Hortonworks partner on Hadoop-as-a-Service
IBM, Pentaho make big data acquisitions