A crawler must crawl the Web in a scalable and efficient way because the bandwidth is neither infinite not free.
查看答案
A Web crawler should follow a combination of four policies, namely selection, revisit, politeness and parallelization.
A. 对
B. 错
A parallelization policy indicates that all Web crawlers can work together without any conflicts.
A. 对
B. 错
Large search engines can index 80% of the current size of the Web.
A. 对
B. 错
It's not difficult for a crawler to download the most relevant pages.
A. 对
B. 错