Formal concept analysis and document clustering via granular computing

Tsau Young Lin, I-Jen Chiang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

A text/web document is a knowledge representation of a human idea (a structured set of thoughts). This paper refines TFIDF and Extended TFIDF(ETFIDF)[16]; These values really measures the co-occurrences of tokens. The ETFID captures the semantic more accurately. Tokens with high TFIDF values are called Keywords. The sets of (n+1) Co-occurring keywords with High ETFIDF are called n-granules. The collection of keywords and n-granules can be interpreted geometrically; they form a non-closed simplicial complex. The corresponding non-closed polyhedron is called Latent Semantic Space(LSS). LSS is a geometric knowledge base that provides the semantic to search engine:

Original languageEnglish
Title of host publicationConference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
Pages4763-4767
Number of pages5
Volume6
DOIs
Publication statusPublished - 2007
Event2006 IEEE International Conference on Systems, Man and Cybernetics - Taipei, Taiwan
Duration: Oct 8 2006Oct 11 2006

Other

Other2006 IEEE International Conference on Systems, Man and Cybernetics
CountryTaiwan
CityTaipei
Period10/8/0610/11/06

Fingerprint

Granular computing
Formal concept analysis
Semantics
Knowledge representation
Search engines

Keywords

  • Granules
  • Latent semantic space
  • Simplex

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Lin, T. Y., & Chiang, I-J. (2007). Formal concept analysis and document clustering via granular computing. In Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics (Vol. 6, pp. 4763-4767). [4274667] https://doi.org/10.1109/ICSMC.2006.385058

Formal concept analysis and document clustering via granular computing. / Lin, Tsau Young; Chiang, I-Jen.

Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics. Vol. 6 2007. p. 4763-4767 4274667.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lin, TY & Chiang, I-J 2007, Formal concept analysis and document clustering via granular computing. in Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics. vol. 6, 4274667, pp. 4763-4767, 2006 IEEE International Conference on Systems, Man and Cybernetics, Taipei, Taiwan, 10/8/06. https://doi.org/10.1109/ICSMC.2006.385058
Lin TY, Chiang I-J. Formal concept analysis and document clustering via granular computing. In Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics. Vol. 6. 2007. p. 4763-4767. 4274667 https://doi.org/10.1109/ICSMC.2006.385058
Lin, Tsau Young ; Chiang, I-Jen. / Formal concept analysis and document clustering via granular computing. Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics. Vol. 6 2007. pp. 4763-4767
@inproceedings{f6627c84d1e94199a9b8d1b533253603,
title = "Formal concept analysis and document clustering via granular computing",
abstract = "A text/web document is a knowledge representation of a human idea (a structured set of thoughts). This paper refines TFIDF and Extended TFIDF(ETFIDF)[16]; These values really measures the co-occurrences of tokens. The ETFID captures the semantic more accurately. Tokens with high TFIDF values are called Keywords. The sets of (n+1) Co-occurring keywords with High ETFIDF are called n-granules. The collection of keywords and n-granules can be interpreted geometrically; they form a non-closed simplicial complex. The corresponding non-closed polyhedron is called Latent Semantic Space(LSS). LSS is a geometric knowledge base that provides the semantic to search engine:",
keywords = "Granules, Latent semantic space, Simplex",
author = "Lin, {Tsau Young} and I-Jen Chiang",
year = "2007",
doi = "10.1109/ICSMC.2006.385058",
language = "English",
isbn = "1424401003",
volume = "6",
pages = "4763--4767",
booktitle = "Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics",

}

TY - GEN

T1 - Formal concept analysis and document clustering via granular computing

AU - Lin, Tsau Young

AU - Chiang, I-Jen

PY - 2007

Y1 - 2007

N2 - A text/web document is a knowledge representation of a human idea (a structured set of thoughts). This paper refines TFIDF and Extended TFIDF(ETFIDF)[16]; These values really measures the co-occurrences of tokens. The ETFID captures the semantic more accurately. Tokens with high TFIDF values are called Keywords. The sets of (n+1) Co-occurring keywords with High ETFIDF are called n-granules. The collection of keywords and n-granules can be interpreted geometrically; they form a non-closed simplicial complex. The corresponding non-closed polyhedron is called Latent Semantic Space(LSS). LSS is a geometric knowledge base that provides the semantic to search engine:

AB - A text/web document is a knowledge representation of a human idea (a structured set of thoughts). This paper refines TFIDF and Extended TFIDF(ETFIDF)[16]; These values really measures the co-occurrences of tokens. The ETFID captures the semantic more accurately. Tokens with high TFIDF values are called Keywords. The sets of (n+1) Co-occurring keywords with High ETFIDF are called n-granules. The collection of keywords and n-granules can be interpreted geometrically; they form a non-closed simplicial complex. The corresponding non-closed polyhedron is called Latent Semantic Space(LSS). LSS is a geometric knowledge base that provides the semantic to search engine:

KW - Granules

KW - Latent semantic space

KW - Simplex

UR - http://www.scopus.com/inward/record.url?scp=34548139063&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34548139063&partnerID=8YFLogxK

U2 - 10.1109/ICSMC.2006.385058

DO - 10.1109/ICSMC.2006.385058

M3 - Conference contribution

AN - SCOPUS:34548139063

SN - 1424401003

SN - 9781424401000

VL - 6

SP - 4763

EP - 4767

BT - Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics

ER -