clarify the rules

This commit is contained in:
Jidong Xiao
2023-10-17 16:43:36 -04:00
parent 121e6cd5f9
commit 8d60da6e08

View File

@@ -75,7 +75,7 @@ When searching *Tom Cruise*, your search engine should not include a page which
Search Engines like Google will search all types of files on the Internet, but in this assignment, we assume all files we search are HTML files. And we consider an HTML file contains the search query only if the search query can be found within the <body></body> section of the HTML file. Search Engines like Google will search all types of files on the Internet, but in this assignment, we assume all files we search are HTML files. And we consider an HTML file contains the search query only if the search query can be found within the <body></body> section of the HTML file.
Based on Rule 1 and Rule 2: the third page showed in this image should not be included in your search results, when the search query is *Tom Cruise*. Based on Rule 1 and Rule 2: when the search query is *Tom Cruise*, the third page showed in this image should not be included in your search results, unless the words *Tom Cruise* appears in the other part of the <body></body> section of this web page, which is not displayed here.
![alt text](images/tom_cruise.png "tom cruise") ![alt text](images/tom_cruise.png "tom cruise")