Interesting Esoterica

Topic-based vector space model

Article by Jörg Becker and Dominik Kuropka
  • Published in 2003
  • Added on
In the collection
This paper motivates and presents the Topic-based Vector Space Model (TVSM), a new vector-based approach for document comparison. The approach does not assume independence between terms and it is flexible regarding the specification of term-similarities. Stop-word-list, stemming and thesaurus can be fully integrated into the model. This paper shows further how the TVSM can be fully implemented within the context of relational databases. This facilitates the use of this approach by generic applications. At the end short comparisons with other vector-based approaches namely the Vector Space Model (VSM) and the Generalized Vector Space Model (GVSM) are presented.

Links


BibTeX entry

@article{Becker2003,
	title = {Topic-based vector space model},
	author = {J{\"{o}}rg Becker and Dominik Kuropka},
	url = {http://www.kuropka.net/files/TVSM.pdf},
	urldate = {2012-03-24},
	abstract = {This paper motivates and presents the Topic-based Vector Space Model (TVSM), a new vector-based approach for document comparison. The approach does not assume independence between terms and it is flexible regarding the specification of term-similarities. Stop-word-list, stemming and thesaurus can be fully integrated into the model. This paper shows further how the TVSM can be fully implemented within the context of relational databases. This  facilitates the use of this approach by generic applications. At the end short comparisons with other vector-based approaches namely the Vector Space Model (VSM) and the Generalized Vector Space Model (GVSM) are presented.},
	comment = {},
	year = 2003,
	collections = {Basically computer science}
}