Big data opens 170 years of chemistry
Europe's largest organization for the chemical sciences, the Royal Society of Chemistry, has accumulated scientific data since the 1840s--long before electronic storage--and big data technology from MarkLogic is making it possible to manage, publish and share the information, according to an article at InfoWorld.
Using MarkLogic's NoSQL database, the RSC was able to publish three times as many journals and four times as many articles. It also allowed for the development of new educational applications to make chemistry more accessible.
The RSC has volumes of data from other sources resulting from a merger in 1980 of The Chemical Society, The Society for Analytical Chemistry, The Royal Institute of Chemistry, and The Faraday Society. Together, they comprise more than one million images, millions of science data files, and hundreds of thousands of articles from more than 200,000 authors.
MarkLogic stores various content types as XML documents, including published papers, contracts, manuals, emails and other metadata that would not fit well in a relational database.
David Leeming, manager at the projects office for the RSC, told InfoWorld that the MarkLogic solution brings the disparate data types together as a single integrated delivery mechanism. The project began in 2010 and has resulted in a 30 percent increase in the number of visitors accessing its 500,000 journal articles and a 70 percent increase in the volume of searches on its educational websites