TY - GEN
T1 - A complex bio-networks of the function profile of genes
AU - Liu, Charles C H
AU - Chiang, I-Jen
AU - Wong, Jau Min
AU - Tsai, Ginni Hsiang Chun
AU - Lin, Tsau Young
PY - 2006
Y1 - 2006
N2 - This paper presents a novel model of concept representation using a multilevel geometric structure, which is called Latent Semantic Networks. Given a set of documents, the associations among frequently co-occurring terms in any of the documents define naturally a geometric complex, which can then be decomposed into connected components at various levels. This hierarchical model of knowledge representation was validated in the functional profiling of genes. Our approach excelled the traditional approach of vector-based document clustering by the geometrical forms of frequent itemsets generated by the association rules. The biological profiling of genes were a complex of concepts, which could be decomposed into primitive concepts, based on which the relevant literature could be clustered in adequate "resolution" of contexts. The hierarchical representation could be validated with tree-based biomedical ontological frameworks, which had been applied for years, and been recently enriched by the online availability of Unified Medical Language System (UMLS) and Gene Ontology (GO). Demonstration of the model and the clustering would be performed on the relevant GeneRIF (References into Function) document set of NOD2 gene. Our geometrical model is suitable for representation of bio-logical information, where hierarchical concepts in different complexity could be explored interactively according to the context of application and the various needs of the researchers. An online clustering search engine for use on general purpose and for biomedical use, managing the search results from Google or from PubMed, are constructed based on the methodology (http://ginni.bme.ntu.edu.tw). The hierarchical presentation of clustering results and the interactive graphical display of the contents of each cluster shows the merits of our approach.
AB - This paper presents a novel model of concept representation using a multilevel geometric structure, which is called Latent Semantic Networks. Given a set of documents, the associations among frequently co-occurring terms in any of the documents define naturally a geometric complex, which can then be decomposed into connected components at various levels. This hierarchical model of knowledge representation was validated in the functional profiling of genes. Our approach excelled the traditional approach of vector-based document clustering by the geometrical forms of frequent itemsets generated by the association rules. The biological profiling of genes were a complex of concepts, which could be decomposed into primitive concepts, based on which the relevant literature could be clustered in adequate "resolution" of contexts. The hierarchical representation could be validated with tree-based biomedical ontological frameworks, which had been applied for years, and been recently enriched by the online availability of Unified Medical Language System (UMLS) and Gene Ontology (GO). Demonstration of the model and the clustering would be performed on the relevant GeneRIF (References into Function) document set of NOD2 gene. Our geometrical model is suitable for representation of bio-logical information, where hierarchical concepts in different complexity could be explored interactively according to the context of application and the various needs of the researchers. An online clustering search engine for use on general purpose and for biomedical use, managing the search results from Google or from PubMed, are constructed based on the methodology (http://ginni.bme.ntu.edu.tw). The hierarchical presentation of clustering results and the interactive graphical display of the contents of each cluster shows the merits of our approach.
UR - http://www.scopus.com/inward/record.url?scp=38549145359&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38549145359&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:38549145359
SN - 3540360484
SN - 9783540360483
VL - 4070 LNBI
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 9
EP - 24
BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
T2 - 2005 IEEE International Conference on Granular Computing
Y2 - 25 July 2005 through 27 July 2005
ER -