Project

Back to overview

Infrastructures for Community-Based Data Management

English title Infrastructures for Community-Based Data Management
Applicant Cudré-Mauroux Philippe
Number 128459
Funding scheme SNSF Professorships
Research institution Département d'Informatique Université de Fribourg
Institution of higher education University of Fribourg - FR
Main discipline Information Technology
Start/End 01.09.2010 - 31.08.2014
Approved amount 1'600'000.00
Show all

Keywords (5)

Web Data Management; Decentralized Data Integration; Data Management Systems; social data management; web 3.0

Lay Summary (English)

Lead
Lay summary
Online communities are increasingly interested in publishing, querying, manipulating, and arbitrary integrating very-large data sets to their own ends. Unfortunately, the current data management infrastructures at their disposal only provide very limited and rather inefficient support to their needs. The present project focuses on designing new distributed data management infrastructures to process and a posteriori combine such community-based data. Two related research problems are tackled. The first problem revolves around the design and implementation of an efficient storage platform to natively store, query, and distribute very large amounts of community-based data. The second problem focuses on new abstraction mechanisms to expose structured data along with pertinent meta-data to heterogeneous social communities.
Direct link to Lay Summary Last update: 21.02.2013

Responsible applicant and co-applicants

Employees

Publications

Publication
Effective named entity recognition for idiosyncratic web collections
Prokofyev Roman, Demartini Gianluca, Cudre-Mauroux Philippe (2014), Effective named entity recognition for idiosyncratic web collections, in World Wide Web Conference, Seoul, Korea.
TripleProv: efficient processing of lineage queries in a native RDF store
Wylot Marcin, Cudre-Mauroux Philippe, Groth Paul (2014), TripleProv: efficient processing of lineage queries in a native RDF store, in World Wide Web Conference, Seoul, Korea.
A Crowd-Based Data Gathering and Management System for Noise Level Data
Wisniewski Mariusz Demartini Gianluca Malatras Apostolos and Cudré-Mauroux Philippe (2013), A Crowd-Based Data Gathering and Management System for Noise Level Data, in MobiWIS.
Large-Scale Linked Data Integration Using Probabilistic Reasoning and Crowdsourcing
Demartini Gianluca Difallah Djellel Eddine and Cudré-Mauroux Philippe (2013), Large-Scale Linked Data Integration Using Probabilistic Reasoning and Crowdsourcing, in The VLDB Journal, 22(5), 665-687.
NoSQL Databases for RDF: An Empirical Evaluation
Cudre-Mauroux Philippe Enchev Iliya Fundatureanu Sever Groth Paul Haque Albert Harth Andre (2013), NoSQL Databases for RDF: An Empirical Evaluation, in ISWC.
Ontology-Based Word Sense Disambiguation for Scientific Literature
Prokofyev Roman Demartini Gianluca Boyarsky Alexey Ruchayskiy Oleg and Cudré-Mauroux Philippe (2013), Ontology-Based Word Sense Disambiguation for Scientific Literature, in ECIR.
Pick-a-crowd: tell me what you like, and I'll tell you what to do
Difallah Djellel Eddine Demartini Gianluca Cudré-Mauroux Philippe (2013), Pick-a-crowd: tell me what you like, and I'll tell you what to do, in WWW.
TRank: Ranking Entity Types Using the Web of Data
Tonon Alberto Catasta Michele Demartini Gianluca Cudre-Mauroux Philippe and Aberer Karl (2013), TRank: Ranking Entity Types Using the Web of Data, in ISWC.
An overview of HYRISE - a Main Memory Hybrid Storage Engine
Grund Martin, Cudre-Mauroux Philippe, Kruger Jens, Madden Samuel, Plattner Hasso (2012), An overview of HYRISE - a Main Memory Hybrid Storage Engine, in IEEE Data Eng. Bull., 35(1), 52-57.
Benchmarking OLTP/web databases in the cloud: the OLTP-bench framework
Curino Carlo Difallah Djellel Eddine Pavlo Andrew Cudré-Mauroux Philippe (2012), Benchmarking OLTP/web databases in the cloud: the OLTP-bench framework, in CloudDB.
Combining inverted indices and structured search for ad-hoc object retrieval
Tonon Alberto, Demartini Gianluca, Cudre-Mauroux Philippe (2012), Combining inverted indices and structured search for ad-hoc object retrieval, in SIGIR 2012.
Efficient Versioning for Scientific Array Databases
Seering Adam, Cudre-Mauroux Philippe, Madden Samuel, Stonebraker Michael (2012), Efficient Versioning for Scientific Array Databases, in ICDE 2012.
Large-scale Linked Data Processing - Cloud Computing to the Rescue?
Hausenblas Michael, Grossman Robert, Harth Andreas, Cudre-Mauroux Philippe (2012), Large-scale Linked Data Processing - Cloud Computing to the Rescue?, in CLOSER 2012.
Mechanical Cheat: Spamming Schemes and Adversarial Techniques on Crowdsourcing Platforms
Difallah Djellel Eddine, Demartini Gianluca, Cudre-Mauroux Philippe (2012), Mechanical Cheat: Spamming Schemes and Adversarial Techniques on Crowdsourcing Platforms, in CrowdSearch 2012.
Semantic Web Meets Computational Intelligence: State of the Art and Perspectives [Review Article]
Chen Huajun, Wu Zhaohui, Cudre-Mauroux Philippe (2012), Semantic Web Meets Computational Intelligence: State of the Art and Perspectives [Review Article], in IEEE Comp. Int. Mag., 7(2), 67-74.
Special Issue on Semantic Web Meets Computational Intelligence [Guest Editorial]
Chen Huajun, Cudre-Mauroux Philippe (2012), Special Issue on Semantic Web Meets Computational Intelligence [Guest Editorial], in IEEE Comp. Int. Mag., 7(2), 14-15.
ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking
Demartini Gianluca, Difallah Djellel Eddine, Cudre-Mauroux Philippe (2012), ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking, in WWW 2012.
HYRISE - A Main Memory Hybrid Storage Engine
Grund Martin, Krüger Jens, Plattner Hasso, Zeier Alexander, Cudre-Mauroux Philippe, Madden Samuel (2010), HYRISE - A Main Memory Hybrid Storage Engine, in PVLDB, 4(2), 105-116.
A Demonstration of DNS^3: a Semantic-Aware DNS Service
Philippe Cudre-Mauroux, Demartini Gianluca, Difallah Djellel Eddine, Elsayed Mostafa Ahmed, Russo Vincenzo, Thomas Matthew, A Demonstration of DNS^3: a Semantic-Aware DNS Service, in ISWC 2011, ISWC 2011, Germany.
A Demonstration of HYRISE - A Main Memory Hybrid Storage Engine
Grund Martin, Cudre-Mauroux Philippe, Madden Samuel, A Demonstration of HYRISE - A Main Memory Hybrid Storage Engine, in PVLDB.
An Integrated Socio-Technical Crowdsourcing Platform for Accelerating Returns in eScience
Aberer Karl, Boyarsky Alexey, Cudre-Mauroux Philippe, Demartini Gianluca, Ruchayskiy Oleg, An Integrated Socio-Technical Crowdsourcing Platform for Accelerating Returns in eScience, in ISWC 2011, ISWC 2011, Germany.
BowlognaBench-Benchmarking RDF Analytics
Demartini Gianluca, Enchev Iliya, Gapany Joel, Cudre-Mauroux Philippe, BowlognaBench-Benchmarking RDF Analytics, in SIMPDA 2011, Uni.Milano, Milano.
dipLODocus[RDF]--Short and Long-Tail RDF Analytics for Massive Webs of Data
Wylot Marcin, Pont Jige, Wisniewski Mariusz, Cudre-Mauroux Philippe, dipLODocus[RDF]--Short and Long-Tail RDF Analytics for Massive Webs of Data, in ISWC 2011, ISWC 2011 (Published by Springer), Germany.
Downscaling Entity Registries for Ad-Hoc Environments
Philippe Cudré-Mauroux Gianluca Demartini Iliya Enchev Christophe Guéret and Benoit Perroud, Downscaling Entity Registries for Ad-Hoc Environments, in DOWNSCALE 2012.
Graph Data Management Systems for New Application Domains
Cudre-Mauroux Philippe, Elnikety Sameh, Graph Data Management Systems for New Application Domains, in PVLDB.
Loose Ontological Coupling and the Social Semantic Web
Philippe Cudre-Mauroux, Loose Ontological Coupling and the Social Semantic Web, in Springer Journal of Ambient Intelligence and Humanized Computing .
Loose Ontological Coupling and the Social Semantic Web
Cudre-Mauroux Philippe, Loose Ontological Coupling and the Social Semantic Web, in AWIC 2011, AWIC 2011 (Published by Springer), Germany.
OLTP-Bench: An Extensible Testbed for Benchmarking Relational Databases
Difallah Djellel Eddine Curino Carlo Pavlo Andrew and Cudré-Mauroux Philippe, OLTP-Bench: An Extensible Testbed for Benchmarking Relational Databases, in PVLDB.
Scalable Anomaly Detection for Smart City Infrastructure Networks
Difallah Djellel Eddine Cudre-Mauroux Philippe and McKenna Sean A., Scalable Anomaly Detection for Smart City Infrastructure Networks, in IEEE Internet Computing.
ScienceWISE: A Web-based Interactive Semantic Platform for Scientific Collaboration
Aberer Karl, Boyarsky Alexey, Cudre-Mauroux Philippe, Demartini Gianluca, Ruchayskiy Oleg, ScienceWISE: A Web-based Interactive Semantic Platform for Scientific Collaboration, in ISWC 2011, ISWC 2011, Germany.
The Bowlogna ontology: Fostering open curricula and agile knowledge bases for Europe's higher education landscape
Demartini Gianluca Enchev Iliya Gapany Joel and Cudré-Mauroux Philippe, The Bowlogna ontology: Fostering open curricula and agile knowledge bases for Europe's higher education landscape, in Semantic Web Journal, 4(1), 53-63.

Collaboration

Group / person Country
Types of collaboration
MIT United States of America (North America)
- in-depth/constructive exchanges on approaches, methods or results
- Publication
VeriSign EMEA Switzerland (Europe)
- in-depth/constructive exchanges on approaches, methods or results
- Publication
DERI, KIT, U. Texas, VU Amsterdam Germany (Europe)
- in-depth/constructive exchanges on approaches, methods or results
- Publication
CMU, Microsoft Research United States of America (North America)
- in-depth/constructive exchanges on approaches, methods or results
- Publication
- Exchange of personnel
Google Switzerland (Europe)
- in-depth/constructive exchanges on approaches, methods or results
- Publication
- Research Infrastructure
- Industry/business/other use-inspired collaboration
EPFL + CERN Switzerland (Europe)
- in-depth/constructive exchanges on approaches, methods or results
- Publication
IBM Research Ireland (Europe)
- in-depth/constructive exchanges on approaches, methods or results
- Publication

Scientific events



Self-organised

Title Date Place
Data Engineering Meets the Semantic Web 08.04.2013 Brisbane, Australia
International Semantic Web Conference 11.11.2012 Boston, United States of America

Awards

Title Year
Amazon AWS Research Grant, Amazon AWS Teaching Grants 2013
Google Faculty Award 2013
Computer Science 2001-2012 Award, NCCR-MICS 2012
VeriSign Inc. Internet Infrastructures Grant 2012
Best Demo Award 2011
Outrageous Idea Prize 2011
Best Mentor Award, ISWC 2010 2010

Associated projects

Number Title Start Funding scheme
147609 Crowdsourced conceptualization of complex scientific knowledge and discovery of discoveries 01.12.2013 Sinergia
153023 Infrastructures for Community-Based Data Management 01.09.2014 SNSF Professorships

Abstract

Structured data are increasingly created, queried, and manipulated by arbitrary communities of users on the Web. The present proposal focuses on designing new distributed data management infrastructures for such community-based data. Two related problems are tackled: the design of efficient storage mechanisms for community-based data, and the design of higher-level abstractions to expose the data along with pertinent meta-data to the social communities.
-