How to identify fake big data products
Here are a few of the product announcements made at Strata to give you an overview of the direction products and partnerships are now trending, or at least leaning.
Microsoft's idea of creating a data science marketplace takes best of show. It is the best idea in a sea of great ideas at Strata NY this year.
IPSoft's Amelia, a "learning cognitive agent," is already gaining high-level work skills that its creators say will put it, or others like it, at the top C-level position one day. And a Hong Kong-based venture capital company has already appointed an algorithm to its board of directors. Will we all work for machines soon--if we can find work at all?
"Commercial banks, credit card companies and credit bureaus have dived into big data, too, mainly for marketing and fraud protection," writes John Lippert in the Washington Post. "They've mostly left advances in the field of credit scoring to upstarts." That, it turns out, is a huge mistake for banks and the consumer credit industries.
U.S. National Institutes of Health (NIH) made multi-institute awards totaling nearly $32 million for 2014 under the NIH's Big Data to Knowledge, known as the BD2K initiative. One of the awards went to the University of Pittsburgh with Carnegie Mellon University, the Pittsburgh Supercomputing Center and Yale as partners, to the tune of $11 million.
It's not all that unusual for a General Electric (GE) business to hit the billion dollar mark, but as Quentin Hardy put it, GE's Internet of Things, or IoT, software business is "probably the fastest a GE business has hit the $1 billion mark." You can expect GE to pull in even more from this business arm because it has one heck of a great IoT strategy.
Yahoo!'s previous world record was 70 minutes using a large, open-source Hadoop cluster of 2100 machines for data processing. DataBricks, founded by the creators of Apache Spark, completed the Daytona GraySort, which is a distributed sort of 100 TB of on-disk data, in 23 minutes with 206 machines with 6,592 cores during this year's Sort Benchmark competition.
By now, you've heard all about big data being used by HR for hiring. In all likelihood, you wondered just how much value big data could be in hiring people. It might be useful in finding and identifying good candidates in highly specialized fields, maybe, but for general HR purposes?
Splunk made a big splash this week in a series of announcements that aggressively ups their big data game. Topping the list is a deal with Amazon Web Services (AWS) to integrate Hunk on Amazon Elastic Map Reduce. But of equal import are the announcements of an upgrade to the flagship platform and a new mobile tool on the heels of the BugSense acquisition.