DataLift - Projet ANR Contint (National Research Agency) - 2010-2013
-
Datalift brings raw structured data coming from various formats (relational databases, CSV, XML, ...) to semantic data interlinked on the Web of Data . Its goal is to develop a platform to publish and interlink datasets on the Web of data. Datalift will both publish datasets coming from a network of partners and data providers and propose a set of tools for easing the datasets publication process : selecting ontologies for publishing data, converting data to the appropriate format (RDF using the selected ontology), publishing the linked data, interlinking data with other data sources
A catalyser for the Web of data
Partners
- Academic partners: INRIA (National Computer Science Research Institute: EXMO team, Edelweiss team) , Eurecom
- Industry partners : Mondeca, Atos Origin Integration
- Institutional partners: IGN (National Geographic Institute), INSEE (National Institute on Statistics and Economical Studies)
- Innovation Partner : FING
Data providers
Other partners
Expected results
Datalift will provide a catalog of ontologies facilitating the data providers' task of selecting ontologies relevant to the data to publish. This catalog will feature concept search, ontology quality, and ontology similarity metrics. Datalift will also provide a data conversion suite that will enable efficient semi-automatic conversion of the raw data to RDF. This suite of tools will intelligently integrate together many data conversion tools and be able to automatically select the relevant tool for the type of data source to convert. Datalift aims at providing a suite of tools for automatic Web data interlinking. Datalift will also conduct a large scale interlinking experiment with the platform's content providers data and with other datasets. An infrastructure for storing and accessing data will be provided, together with a suite of tools allowing to navigate and interact with the linked data resulting from the platform lifting process. By extending semantic Web description and querying formalisms to manage licenses and rights, we expect to overcome one of the major obstacles for the publication of data by data providers. Having license information will allow them to keep the credits on their data.
> DataLift website > Tous les projets