Combinations of HTTP GET (URL-based)parameters can produce different URLS, all of which may be linked on the site.
查看答案
A crawler must crawl the Web in a scalable and efficient way because the bandwidth is neither infinite not free.
A. 对
B. 错
A Web crawler should follow a combination of four policies, namely selection, revisit, politeness and parallelization.
A. 对
B. 错
A parallelization policy indicates that all Web crawlers can work together without any conflicts.
A. 对
B. 错
Large search engines can index 80% of the current size of the Web.
A. 对
B. 错