It's not all about Hadoop
The Apache Hadoop Project has resulted in the biggest, most accessible and most recognizable open source database available for big data. But it's not the only one. HPCC Systems, a spinoff from LexisNexis, is ready to challenge it, according to CIO magazine.
HPCC has been the engine behind LexisNexis' $1.5 billion data-as-a-service business for about a decade. It is open-sourced under the Apache 2.0 license. And like Hadoop, it uses commodity hardware and local storage interconnected through IP networks, but it has only been available for about a year, while Hadoop has been used the last six years to become the de facto standard for big data.
Hadoop has more users and a bigger open source developer community, but CIO says "HPCC is a more mature enterprise-ready package that uses a higher-level programming language called enterprise control language (ECL) based on C++, as opposed to Hadoop's Java."
Flavio Villanustre, vice president of information security and the lead for the HPCC Systems initiative within LexisNexis, told CIO that this gives HPCC advantages in terms of ease of use, as well as backup and recovery of production.
Forrester research added that HPCC also "possesses more mission-critical functionality" than Hadoop and has the "security, recovery, audit and compliance layers that Hadoop lacks."
Proponents of HPCC say it is designed more like an SQL database and can answer direct questions versus a more step-by-step process for extracting information required by Hadoop. Hadoop supporters, such as Charles Zedlewski, vice president of products at Cloudera, told CIO that Hadoop can cater to a wider range of end users and is more scalable, and flexible and less expensive.
For more of the debate:
- see the CIO article