LinkedIn has made Kafka Monitor, a framework for monitoring and testing Kafka deployments, open source, and it's now available on Github. It continuously monitors SLAs in production clusters and runs regression tests in test clusters.
You would think statisticians and big data scientists would be brothers in arms, or at least amicable team members. In some cases, they are. But in others, not so much. The good news is that it now looks as if the two disciplines are attempting to make peace.
A few days ago, 23andMe announced a new ResearchKit module researchers can use to integrate genetic information into their app-based studies. Mount Sinai and LifeMap's Asthma Health app and Stanford Medicine's MyHeart Counts app, both the first to use the module, were immediately released. While the genetic data is limited to 23andMe's 1.2 million plus customers, it's an important step in easily incorporating genetic data in app-based studies – indeed, it's a harbinger of the future of clinical research.
According to the CDC, the annual flu season is between October and May and it commonly peaks between December and February. This year, there's a late flu outbreak, and in January, Sickweather's Nightingale, an emerging AI forecasting engine, accurately forecast its appearance in March. And here it is, right on cue.
POPULAR COMMENT THREADS
If you have an entrepreneurial streak, big data skills, and a startup dream, here's your chance to make it all happen. Go ahead, look at the sky and the stars and dream a big dream. Just be sure to also get your entry in by the deadline!
Alpine Data announced Chorus 6, an upgrade of its integrated analytics platform which adds collaboration and governance capabilities to machine learning projects for both business users and data scientists. It's designed to reduce the friction between humans regarding big data projects and to focus the work beyond the never-ending quest for the perfect algorithm.
Cray, a supercomputing giant, launched its Cray Urika-GX system, a platform that combines supercomputing with an open, enterprise-ready software framework for big data analytics. Think of it as the speed and scale of supercomputers with the handiness of an appliance and the flexibility and reliability of open-source. The product is aimed at conquering cluster and application sprawl in large enterprises.
Confluent launched Kafka Streams, a lightweight product for stream processing development. It's part of the now generally released open source Confluent Platform 3.0 which also features Confluent's first commercial product, the Confluent Control Center, for managing Kafka Clusters.
KPMG, an audit, tax and advisory firm, released a report profiling fraudsters and measuring the success in using data and analytics in detecting fraud. While proactive analytics are often a good way to detect fraud, KPMG found they were not the primary detection means in any North American frauds, and used to detect only 3 percent of fraudsters worldwide.
From Our Sister Sites
On Thursday the jury in Oracle's $9.3 billion copyright lawsuit against Google unanimously found that Google was in the right when it used Java technology in its Android operating system. Privacy advocacy group the Electronic Frontier Foundation lauded the jury's decision.
Microsoft and Facebook are teaming up to build what could be the most powerful transatlantic fiber optic cable yet, The Next Web said, all in an effort to make the web more reliable for everyone.