• New AWS public data sets and the rise of a data singularity

    Among the latest efforts on the path to life-saving cures is AWS's recent announcement of two new public data sets: The Cancer Genome Atlas and The International Cancer Genome Consortium PanCancer dataset. These are two of the world's largest collections of cancer genome data and they're both now available at no cost on AWS as part of the AWS Public Data Sets program.

Possible first-ever genomic medicine diagnostic center opens

Manys stories have covered the use of big data and genomics to find new treatments and cures for diseases. The hope within that coverage is palpable but with a distinct "maybe one day" feel. But now there comes an announcement of the opening of The Smith Family Clinic for Genomic Medicine in Huntsville, Alabama, which just may be the very first clinic to be "designed solely for providing diagnoses to patients with undiagnosed disease via the exclusive use of whole genome sequencing data."

USPTO Global Dossier Initiative launches first service

The U.S. Department of Commerce's Patent and Trademark Office launched its first service for Global Dossier Monday. The goal of the new service is to make it easier for patent applicants to "view, monitor and manage intellectual property protection around the world by providing access to the dossiers of related applications filed at participating offices," the USPTO explained in a press release.

StealthINTERCEPT new release doubles down on attack analytics

The new release of StealthINTERCEPT security software Tuesday added built-in analytics for pre-authentication failures, breached passwords, concurrent log-ins, impersonation log-ins and golden ticket attack vectors to the existing attack analytics for account hacking, brute force attacks and horizontal account movement detection.

The technical challenges LinkedIn encountered with Slideshare acquisition

LinkedIn acquired Slideshare in 2012, but as many companies have already learned the hard way, acquiring a company is the easy part. Integration of data, technologies and cultures always present daunting challenges, and LinkedIn found this acquisition to be no exception.

URX analyzes Wikipedia to train machine learning models

Using the Wikipedia website for this exercise was a perfect demonstration, as there is useful information there but it can easily be lost in the noise. According to URX, while the English Wikipedia corpus consists of 15 million pages, "only about one in three pages are considered informative enough for learning."


From Our Sister Sites


The Houston Police Department reported a drastic decrease in the clearance of sexual assault cases this year, and said its records management system is partially to blame. HPD cleared 44 percent of...


Sources familiar with the Dell-EMC merger deal said that certain EMC and VMware shareholders are demanding that Dell change the intricately orchestrated plan it set out for the biggest tech acquisition ever.