Is Wikipedia a latent gene ontology?

Dessì, Nicoletta;Atzori, Maurizio
2017-01-01

Abstract

Despite the significant contribution from specialized ontologies and text mining methods, the evaluation of the semantic similarity of genes remains difficult because of the complex functions in which genes are involved. A less exploited resource is Wikipedia that stores more than 10400 articles about human genes: each gene name identifies the corresponding Wikipedia page resuming gene's properties in short sentences where hyperlinks define relationships with other genes in Wikipedia. This paper evaluates the extent to which the Wikipedia can be trusted for assessing the similarity of a gene pair as the distance between their Wikipedia pages. We present a set of experiments that make use of TagMe (a powerful tool for evaluating the distance of two Wikipedia pages based on their annotations) to calculate the semantic similarity of several sets of genes on Wikipedia. Results compare well with gold standards and semantic similarity values evaluated on gene ontologies. The paper demonstrates the effectiveness of Wikipedia in recognizing functional groups of genes, the quality and the wealth of its knowledge about genes as well the accuracy of TagMe.
2017
Inglese
Proceedings - 2017 IEEE 26th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises, WETICE 2017
978-153861758-8
Institute of Electrical and Electronics Engineers (IEEE)
164
169
6
26th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises, WETICE 2017
Comitato scientifico
21-23 June 2017
Poznan, Poland
internazionale
scientifica
Gene relatedness; Semantic similarity; TagMe; Text mining; Wikipedia; Computer networks and communications; Business, management and accounting (miscellaneous); Hardware and architecture
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Dessì, Nicoletta; Atzori, Maurizio
273
2
4.1 Contributo in Atti di convegno
reserved
info:eu-repo/semantics/conferencePaper
File in questo prodotto:
File Dimensione Formato  
wetice17 - is wikipedia a latent gen ontology.pdf

Solo gestori archivio

Tipologia: versione editoriale
Dimensione 460.84 kB
Formato Adobe PDF
460.84 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Questionario e social

Condividi su:
Impostazioni cookie