-
Text Pre-Processing to Latent Semantic Indexing
Slide 2: text preprocessing Stopwords removal Stemming Basic stemming methods remove ending transform words Digits Hyphens Punctuation Marks Case of Letters Identifying different text fields Identifying anchor text: Removing HTML ... -
Information Retrieval and Web Search
slide1:Introduction to Information Retrieval (IR) slide 4: Information Retrieval (IR) Architecture slide 6:IR queries types Keyword queries Boolean queries (using AND, OR, NOT) Phrase queries Proximity queries Full document queries Natural ...