Designing New Crawling and Indexing Techniques for Web Search Engines - Tan Qingzhao
Designing New Crawling and Indexing Techniques for Web Search Engines - Tan Qingzhao
AutorzyTan Qingzhao
EAN: 9783639204001
Symbol
724EUL03527KS
Rok wydania
2009
Elementy
156
Oprawa
Miekka
Format
15.2x22.9cm
Język
angielski

Bez ryzyka
14 dni na łatwy zwrot

Szeroki asortyment
ponad milion pozycji

Niskie ceny i rabaty
nawet do 50% każdego dnia
Niepotwierdzona zakupem
Ocena: /5
Symbol
724EUL03527KS
Kod producenta
9783639204001
Autorzy
Tan Qingzhao
Rok wydania
2009
Elementy
156
Oprawa
Miekka
Format
15.2x22.9cm
Język
angielski

This thesis studies in a Web search engine how a crawler with limited computing resource can effectively crawl from the dynamically changing Web and acquire the most updated Web documents, and how a Web search engine can provide information-object--oriented indexing methods which enable users to retrieve desired information with high accuracy and high efficiency. To address the first problem, we design a set of sampling policies with various downloading granularity for the sampling method, taking into account the link structure, the directory structure, and the content-based features which include the clustering technique. We further extend the clustering-based sampling approach by testing more dynamic features and strategically selecting samples from each cluster. For the second problem, we propose building indexes on extracted metadata of various information objects, instead of the whole document. We set up a digital library named ArchSeer for the domain of archeology. ArchSeer allows users to retrieve archeology literature via domain-specific search engines.
EAN: 9783639204001
EAN: 9783639204001
Niepotwierdzona zakupem
Ocena: /5
Zapytaj o produkt
Niepotwierdzona zakupem
Ocena: /5
Napisz swoją opinię