Publication

Back to overview

FUn: a framework for interactive visualizations of large, high-dimensional datasets on the web

Type of publication Peer-reviewed
Publikationsform Original article (peer-reviewed)
Author Probst Daniel, Reymond Jean-Louis,
Project Exploiting and Extending GDB for Drug Discovery
Show all

Original article (peer-reviewed)

Journal Bioinformatics
Volume (Issue) 34(8)
Page(s) 1433 - 1435
Title of proceedings Bioinformatics
DOI 10.1093/bioinformatics/btx760

Open Access

Abstract

Motivation During the past decade, big data have become a major tool in scientific endeavors. Although statistical methods and algorithms are well-suited for analyzing and summarizing enormous amounts of data, the results do not allow for a visual inspection of the entire data. Current scientific software, including R packages and Python libraries such as ggplot2, matplotlib and plot.ly, do not support interactive visualizations of datasets exceeding 100 000 data points on the web. Other solutions enable the web-based visualization of big data only through data reduction or statistical representations. However, recent hardware developments, especially advancements in graphical processing units, allow for the rendering of millions of data points on a wide range of consumer hardware such as laptops, tablets and mobile phones. Similar to the challenges and opportunities brought to virtually every scientific field by big data, both the visualization of and interaction with copious amounts of data are both demanding and hold great promise. Results Here we present FUn, a framework consisting of a client (Faerun) and server (Underdark) module, facilitating the creation of web-based, interactive 3D visualizations of large datasets, enabling record level visual inspection. We also introduce a reference implementation providing access to SureChEMBL, a database containing patent information on more than 17 million chemical compounds. Availability and implementation The source code and the most recent builds of Faerun and Underdark, Lore.js and the data preprocessing toolchain used in the reference implementation, are available on the project website (http://doc.gdb.tools/fun/).
-