DIADEM

Domain-centric Intelligent Automated Data Extraction Methodology

 Coordinatore THE CHANCELLOR, MASTERS AND SCHOLARS OF THE UNIVERSITY OF OXFORD 

Spiacenti, non ci sono informazioni su questo coordinatore. Contattare Fabio per maggiori infomrazioni, grazie.

 Nazionalità Coordinatore United Kingdom [UK]
 Totale costo 2˙402˙846 €
 EC contributo 2˙402˙846 €
 Programma FP7-IDEAS-ERC
Specific programme: "Ideas" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013)
 Code Call ERC-2009-AdG
 Funding Scheme ERC-AG
 Anno di inizio 2010
 Periodo (anno-mese-giorno) 2010-04-01   -   2015-03-31

 Partecipanti

# participant  country  role  EC contrib. [€] 
1    THE CHANCELLOR, MASTERS AND SCHOLARS OF THE UNIVERSITY OF OXFORD

 Organization address address: University Offices, Wellington Square
city: OXFORD
postcode: OX1 2JD

contact info
Titolo: Ms.
Nome: Gill
Cognome: Wells
Email: send email
Telefono: +44 1865 289800
Fax: +44 1865 289801

UK (OXFORD) hostInstitution 2˙402˙846.00
2    THE CHANCELLOR, MASTERS AND SCHOLARS OF THE UNIVERSITY OF OXFORD

 Organization address address: University Offices, Wellington Square
city: OXFORD
postcode: OX1 2JD

contact info
Titolo: Prof.
Nome: Georg
Cognome: Gottlob
Email: send email
Telefono: -285325
Fax: -275660

UK (OXFORD) hostInstitution 2˙402˙846.00

Mappa


 Word cloud

Esplora la "nuvola delle parole (Word Cloud) per avere un'idea di massima del progetto.

search    structure    automatically    pages    data    website    foundations    output    real    extraction    domain    input    explore    structured    web    specialized    domains    richly    estate    site    url    logical   

 Obiettivo del progetto (Objective)

'This proposal is in the area of automated web data extraction and web data management. The aim of our project is to provide the logical, methodological, and algorithmic foundations for the knowledge-based extraction of structured data from web sites belonging to specific domains, such as estate agents, restaurants, travel agencies, car dealers, and so on. One core part of this will be a comprehensive multi-dimensional logical data model that will be used to simultaneously represent both the content of a large website, its structure, inferred user-interaction patterns and all meta-information and knowledge (factual and rule-based) that is necessary to automatically perform the desired extraction tasks. I envision that, based on these new foundations, we will be able to build extremely powerful systems that autonomously explore websites of a given domain, understand their structure and extract and output richly structured data in formats such as XML or RDF. We aim at systems that take as input a URL of a website in a given domain, automatically explore this site and deliver as output a structured data set containing all the relevant information present on that site. As an example, imagine a system specialized in the real-estate domain, that receives as input the URL of any real-estate agent, explores the site automatically and outputs richly structured records of all properties that are currently advertised for sale or for rent on the many web pages of this site. We plan to develop and implement at least two such systems for two different domains, including the one mentioned. The breakthrough in automatic data extraction that we are striving for would enable a quantum leap for two interrelated technologies which are the hottest next topics in web search: vertical search, that is, web search in specialized domains, and object search, that is, the search for web data objects rather than web pages.'

Altri progetti dello stesso programma (FP7-IDEAS-ERC)

GLASSDEF (2012)

Driven Glasses: from statistical physics to materials properties

Read More  

TORMCJ (2008)

"Thermal, optical and redox processes in molecular conduction junctions"

Read More  

MIGRANT SOCIALITIES (2009)

New Migrant Socialities: Ethnic Club Cultures in Urban Europe

Read More