From 7888cadbac3e02b04cfa3e053ef0b13b6d944ca6 Mon Sep 17 00:00:00 2001 From: Jidong Xiao Date: Thu, 14 Mar 2024 21:58:44 -0400 Subject: [PATCH] clarify the term all documents --- old_hws/07_search_engine/README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/old_hws/07_search_engine/README.md b/old_hws/07_search_engine/README.md index 65e2ed2..0e90104 100644 --- a/old_hws/07_search_engine/README.md +++ b/old_hws/07_search_engine/README.md @@ -1,3 +1,5 @@ +**Note:** Sample output files will be added later; submitty autograder will not open until Friday afternoon. + -For each keyword, the keyword's density score is a measure of how the keyword's frequency in a document compares to its occurrence in all documents, and we can use the following formula to calculate the density score of one keyword. +For each keyword, the keyword's density score is a measure of how the keyword's frequency in a document compares to its occurrence in all documents, and we can use the following formula to calculate the density score of one keyword. (**Note:** here the term "all documents" literally means all documents, not just the documents which contain the query.) ```console Keyword Density Score = (Number of Times Keyword Appears) / (Total Content Length of this One Document * Keyword Density Across All Documents)