move hw7 to hws
This commit is contained in:
@@ -1,475 +0,0 @@
|
||||
**Note:** Sample output files will be added later; submitty autograder will not open until Friday afternoon.
|
||||
|
||||
<!-- Clarification
|
||||
|
||||
We made a clarification on the discussion forum. In case you didn't pay attenton there, we are adding the clarification here.
|
||||
|
||||
2. However, these are two situations where the above rule does not apply:
|
||||
|
||||
2.1. when constructing the snippet, this above rule does not apply. When constructing the snippet, you just find the first occurrence of that word (or that query), and that really is saying that you can just call the **std::string::find**() function to find the first occurrence of that word (or that query) within the body section of the HTML file. And therefore your snippet may be like this:
|
||||
|
||||
"I am Lady Gaga."
|
||||
|
||||
when the search is a phrase search of "Lady Gaga". So this means that "." after Gaga is okay, we do not care.
|
||||
|
||||
This is also why for test case 4.2, the following is showed in the snippet:
|
||||
|
||||
"Since 1982, The Statue of Liberty-Ellis Island Foundation has partnered with the"
|
||||
|
||||
when the search query is a phrase search of "Statue of Liberty". And this means that "-" after Liberty is okay, we do not care.
|
||||
|
||||
2.2. when counting the number of occurrences of each keyword (in the keyword density score calculation process), the above rule does not apply. When counting the occurrences of each keyword, you can just call the **std::string::find**() function to find the occurrence of that keyword. And therefore, when the keyword is *Gaga*, and the **std::string::find**() function finds *Gaga* in the sentence of "I am Lady Gaga.", that is okay, we will count this one as a valid occurrence even though there is period "." after *Gaga*.
|
||||
|
||||
So you may see that 1 and 2 are not consistent; but the only reason we allow this inconsistence to exist in this assignment is to simplify your task. A fully functioning search engine will need to handle a lot of complicated cases, and that's way beyond the scope of this course.-->
|
||||
|
||||
# Homework 7 — Design and Implementation of a Simple Google
|
||||
|
||||
In this assignment you will develop a simple search engine called New York Search. Your program will mimic some of the features provided by Google. Please read the entire handout before starting to code the assignment.
|
||||
|
||||
## Learning Objectives
|
||||
|
||||
- Practice writing recursive programs.
|
||||
- Practice using std::map and std::set.
|
||||
|
||||
## Background
|
||||
|
||||
When talking about Google Search Engine, what words come to your mind? Page Ranking? Inverted Indexing? Web Crawler?
|
||||
|
||||
When developing a search engine, the first question we want to ask is, where to start? When you type "Selena Gomez" or "Tom Brady" in the search box in Google, where does Google start? Does Google start searching from one specific website? The answer is Google does not start from one specific website, rather they maintain a list of URLs which are called Seed URLs. These Seed URLs are manually chosen which represent a diverse range of high-quality, reputable websites. Search engines usually have a component called web crawler, which crawls these URLs and then follow links from these web pages to other web pages. As the web crawler crawls these other web pages, it collects links from these other web pages to more web pages, and then follow these links to crawl more web pages. This process continues, ultimately, the goal is to discover as many web pages as possible. Once all pages are visited, the search engine will build a map, which is known as the inverted index, which maps terms (i.e., individual words) to web pages (also known as documents). Below is an example:
|
||||
|
||||
| Key (Term) | Value (List of Document References) |
|
||||
|-----------|---------------------------------------|
|
||||
| apple | Document1, Document3, Document5 |
|
||||
| banana | Document2, Document4 |
|
||||
| orange | Document1, Document2 |
|
||||
|
||||
**Note**: in this README, the term web page, page, document, and HTML file, all have the same meaning.
|
||||
|
||||
<!--Term Frequency
|
||||
|
||||
Metadata and Links:
|
||||
|
||||
The index may also store metadata associated with each web page, such as the page's URL, title, and description. Additionally, the index can include information about links from one page to another, which is used for link analysis and page ranking.-->
|
||||
|
||||
When a user enters a search query, the search engine consults its inverted index map to identify the documents that match the query term. These matching documents will then be ranked based on various factors, and the ranked documents will then be presented to the user. And this ranking process is the so-called Page Ranking.
|
||||
|
||||
## Implementation
|
||||
|
||||
Based on the above description, you can see there are 3 steps when implementing a search engine:
|
||||
|
||||
1. web crawling
|
||||
2. query searching
|
||||
3. page ranking
|
||||
|
||||
And thus, in this assignment, you are recommended to (but not required to) write your search engine following this same order of 3 steps. More details about each of these 3 steps are described below:
|
||||
|
||||
### Web Crawling
|
||||
|
||||
The Web Crawler's goal is to build the inverted index.
|
||||
|
||||
### Query Searching
|
||||
|
||||
The Query Searching component's goal is to identify the matching documents.
|
||||
|
||||
### Page Ranking
|
||||
|
||||
Once the matching documents are identified, you should rank these documents and present them to the user. Google uses a variety of factors in its page ranking, but in this assignment, your page ranking is required to consider the following factors:
|
||||
|
||||
- Keywords Density. <!--(keyword stuffing)-->
|
||||
- Backlinks. <!--: The number and quality of links from other reputable websites are assessed.-->
|
||||
<!--- Freshness.-->
|
||||
|
||||
For each page to be presented, we calculate a page score, and then present these pages in a descending order to the user, i.e., pages whose page score is higher should be presented first. As the page score consists of two factors, we will calculate the score for each of these two factors, and we name them the *keywords density score*, and the *backlinks score*, respectively. Once we have these two scores, we can get the page score using this formula:
|
||||
|
||||
page score = (0.5 * keywords density score + 0.5 * backlinks score); [**formula 1**] <a name="formula-1"></a>
|
||||
|
||||
In order to match the results used by the autograder, you should define all scores as *double*. Next we will describe how to calculate the keywords density score and the backlinks score.
|
||||
|
||||
#### Keywords Density Score
|
||||
|
||||
A search query may contain one keyword or multiple keywords. Given a set of keywords, we can calculate the keywords density score by following these two steps:
|
||||
|
||||
1. Calculate a density score for each keyword within the document.
|
||||
2. Accumulate these individual density scores into a combined score. <!--represent the overall keyword density of the document for the given set of keywords.-->
|
||||
|
||||
For each keyword, the keyword's density score is a measure of how the keyword's frequency in a document compares to its occurrence in all documents, and we can use the following formula to calculate the density score of one keyword. (**Note:** here the term "all documents" literally means all documents, not just the documents which contain the query.)
|
||||
|
||||
```console
|
||||
Keyword Density Score = (Number of Times Keyword Appears) / (Total Content Length of this One Document * Keyword Density Across All Documents)
|
||||
```
|
||||
|
||||
Here, we consider the content of each document as a string. Also, here "Total Content Length" means the total length of the whole document, not just the length of the <body> section; and the "Number of Times Keyword Appears" means the number of times the keyword appears in the whole document, not just in the <body> section. Similarly, when calculating the "Keyword Density Across All Documents", you should also consider the whole document, not just the <body> section.
|
||||
|
||||
Let's explain this formula with an example: let's say we have 4 documents in total, and the user wants to search *Tom Cruise*. Assume the first document has 50 characters (i.e., the document length of the first document is 50), the second document has 40 characters, the third document has 100 characters, and the fourth document has 200 characters. The keyword *Tom* appears in the first document 2 times, appears in the second document 3 times, appears in the third document 4 times, and appears in the fourth document 0 times. Then for this keyword *Tom*, the density across all documents would be:
|
||||
|
||||
```console
|
||||
(2 + 3 + 4 + 0) / (50 + 40 + 100 + 200) = 0.023
|
||||
```
|
||||
|
||||
and the keyword density score for this keyword *Tom* in the first document, would be:
|
||||
|
||||
```console
|
||||
2 / (50 * 0.023) = 1.739
|
||||
|
||||
```
|
||||
|
||||
and the keyword density score for this keyword *Tom* in the second document, would be:
|
||||
|
||||
```console
|
||||
3 / (40 * 0.023) = 3.261
|
||||
|
||||
```
|
||||
|
||||
and the keyword density score for this keyword *Tom* in the third document, would be:
|
||||
|
||||
```console
|
||||
4 / (100 * 0.023) = 1.739
|
||||
|
||||
```
|
||||
|
||||
Once we get the density score for the keyword *Tom* in the first document (let's denote this score by denScore1), and we get the density score for the keyword *Cruise* in the first document (let's denote this score by denScore2), then the keywords density score for the search query *Tom Cruise* in the first document would be *(denScore1 + denScore2)*.
|
||||
|
||||
#### Backlinks Score
|
||||
|
||||
There are typically two types of links on the Internet.
|
||||
|
||||
1. **Outgoing Links**: These are links from a particular webpage on your website to other webpages or websites. Outgoing links are also known as "outbound links". They provide navigation from your webpage to other relevant resources on the internet.
|
||||
|
||||
2. **Incoming Backlinks**: These are links from other websites or webpages that direct users to a specific webpage on your website. Incoming backlinks are also commonly referred to as "inbound links" or simply "backlinks". Search engines like Google consider incoming backlinks as an important factor when determining the authority, relevance, and popularity of a webpage. Pages with a higher number of quality backlinks are often perceived as more authoritative and are likely to rank higher in search engine results pages.
|
||||
|
||||
A backlinks score for a webpage is based on the importance of its incoming backlinks, considering that pages with fewer outgoing links are considered more valuable and contribute more to the score. Let's say there are N web pages which have links pointing to this current page. We name these pages doc_1, doc_2,... to doc_N, and we use doc_i->outgoingLinks to denote how many outgoing links document i has. Then we can calculate the backlinks score of this current page as following:
|
||||
|
||||
|
||||
```console
|
||||
backlinks score = ( 1.0 / (1 + doc_1->outgoingLinks) + 1.0 / (1 + doc_2->outgoingLinks) + ... + 1.0 / (1 + doc_N->outgoingLinks) );
|
||||
```
|
||||
|
||||
Once you have both the keywords density score and the backlinks score, you can then use [formula 1](#formula-1) to get the overall score for a page.
|
||||
|
||||
## Assignment Scope
|
||||
|
||||
To reduce the scope of the assignment, and hence reduce the amount of work from you, we make the following rules for this search engine.
|
||||
|
||||
### Rule 1. Search HTML Files Only
|
||||
|
||||
Search Engines like Google will search all types of files on the Internet, but in this assignment, we assume all files we search are HTML files.
|
||||
|
||||
### Rule 2. Local Searching Only
|
||||
|
||||
The search engine you implement will not search anything on the Internet, as that requires extensive knowledge in computer networks and will need to include network libraries, which is way beyond the scope of this course. In this assignment, we limit our searches to a local folder, which is provided as [html_files](html_files).
|
||||
|
||||
You are also not allowed to use file system libraries such as <filesystem> to access the HTML files, rather, you should follow the instructions given in the [other useful code](#other-useful-code) section to open HTML files and follow links within each HTML file to get to other HTML files.
|
||||
|
||||
## Supported Commands
|
||||
|
||||
Your program will be run like this:
|
||||
|
||||
```console
|
||||
nysearch.exe html_files/index.html input.txt
|
||||
```
|
||||
|
||||
Here:
|
||||
|
||||
- *nysearch.exe* is the executable file name.
|
||||
- html_files/index.html is the Seed URL. While Google maintains a list of Seed URL, in this assignment, we will just use one single HTML file as the Seed page and the path of this file is the Seed URL.
|
||||
- input.txt is the input file which contains search queries. Each line of this file is a search query.
|
||||
|
||||
Your program should treat each line in the input file as a search query, and print the search results corresponding to each search query into a separate file.
|
||||
Name your output file(s) this way: out1.txt, out2.txt, out3.txt, out4.txt, ...
|
||||
|
||||
Here
|
||||
1. out1.txt contains the search results for the first search query - i.e., the query appears in line 1 of the input file.
|
||||
2. out2.txt contains the search results for the second search query - i.e., the query appears in line 2 of the input file.
|
||||
3. out3.txt contains the search results for the third search query - i.e., the query appears in line 3 of the input file.
|
||||
4. out4.txt contains the search results for the fourth search query - i.e., the query appears in line 4 of the input file.
|
||||
...
|
||||
|
||||
You must name your output files in such a way. You will fail the test cases if your output files are not named as "out1.txt", "out2.txt", "out3.txt", "out4.txt", etc. And yes, if the input file has 1000 lines, then your program will produce 1000 output files.
|
||||
|
||||
### Phrase Search vs Regular Search
|
||||
|
||||
Your search engine should support both phrase search and regular search.
|
||||
1. When searching multiple words with double quotes, it is called a phrase search. In phrase search, the whole phrase must exist somewhere in the searched document. In other words, the search engine will search for the exact phrase, word for word, and in the specified order.
|
||||
2. When searching multiple words without double quotes, it is called a regular search. In this assignment, we define the term *regular search* as such: the search engine should look for documents which contain every word of the search query, but these words do not need to appear together, and they can appear in any order within the document.
|
||||
|
||||
Based on the above definition, a document which only contains the following two lines is a valid document when the user performs a regular search looking for *Tom Cruise*:
|
||||
|
||||
```console
|
||||
Tom and Jerry show
|
||||
Have Fun And Save Now With Great Deals When You Cruise With Carnival. Book Online Today.
|
||||
```
|
||||
|
||||
Because we can find both the word *Tom* and the word *Cruise*. But it is not a valid document if the user does a phrase search - *"Tom Cruise"*, as no exact match can be found in this document.
|
||||
|
||||
### Definition of Match
|
||||
|
||||
When searching a document, you should follow these rules:
|
||||
|
||||
### Rule 1. Case-sensitive Search Engine
|
||||
|
||||
Search engines are usually case-insensitive, but making the search engine case-insensitive will require some extra work and likely need to call some functions we have not learned in this course. Therefore, to simplify your tasks and reduce the amount of your work, in this assignment, the search engine you are going to implement is case-sensitive. In other words, when searching *Tom*, the word *Tom* is a match, neither the word *TOM* nor the word *tom* is a match.
|
||||
|
||||
### Rule 2. Word Boundary
|
||||
|
||||
When searching the word *Tom*, we do not consider the substring *Tom* in *Tomato* as a match, and we do not consider the substring *Tom* in *4Tom* or *Tom32* as a match; but we do consider the substring *Tom* in *Tom.*, *Tom-*, *.Tom*, *-Tom*, *_Tom*, *Tom!*, " Tom", " Tom ", etc., as a match. In other words, the word *Tom* is found in a document only if it appears as a standalone word, meaning that the character right before *Tom* and the character right after *Tom* must be a word boundary. And in this assignment, you can consider any non-alphanumeric character as a word boundary. This behavior is consistent with what Google does.
|
||||
|
||||
Such a rule also applies to phrase search. We consider a phrase to be a match only if we find the phrase and the character right before the phrase and the character right after the phrase is a word boundary, i.e., a non-alphanumeric character.
|
||||
|
||||
To determine if a character is an alphanumeric character or not, you can call std::isalnum(). This function considers the following characters as alphanumeric:
|
||||
|
||||
```console
|
||||
digits (0123456789)
|
||||
uppercase letters (ABCDEFGHIJKLMNOPQRSTUVWXYZ)
|
||||
lowercase letters (abcdefghijklmnopqrstuvwxyz)
|
||||
```
|
||||
|
||||
The function takes one single character as its sole argument. It return a non-zero value if the character is an alphanumeric character, 0 otherwise.
|
||||
|
||||
## Input Files
|
||||
|
||||
Your program takes two types of input files: the HTML files and the input.txt file, which contains all the search query terms.
|
||||
|
||||
All the HTML files are provided under the [html_files](html_files) directory. Among these HTML files, there is only one HTML file which will be provided via the command line, and this file will be considered as the Seed file, and the path of this file (i.e. html_files/index.html) therefore will be used as the Seed URL. Your web crawler should search this HTML file and find links contained in this HTML file, and then follow these links to crawl other HTML files, and repeat this process until you can not reach any more files. Keep in mind that links which take you to an HTML file which you have already crawled, should be skipped, otherwise you will get into an infinite loop situation.
|
||||
|
||||
## Output File Format
|
||||
|
||||
The output of your program should go to the output file.
|
||||
|
||||
- If no matches can be found for a search query, your search engine should print the following message to the output file.
|
||||
|
||||
```console
|
||||
Your search - dsdwoddjojdjeokdddfjwoewojo - did not match any documents.
|
||||
```
|
||||
|
||||
Replace *dsdwoddjojdjeokdddfjwoewojo* with the search query.
|
||||
|
||||
This behavior matches with what Google does.
|
||||
|
||||

|
||||
|
||||
- If matches are found, you should print the ranked results in a format similar to what Google does, as shown in this following image:
|
||||
|
||||

|
||||
|
||||
More specifically, for each document, print
|
||||
1. the title
|
||||
2. the url
|
||||
3. the description
|
||||
4. a snippet
|
||||
|
||||
### The Title
|
||||
|
||||
In all HTML files we provide, in the <head> section of the HTML, we have a "title" element. It is used to define the title of the web page or document. In the following example, the text "ESPN" within the <title> tags represents the title of the web page, which is typically displayed in the browser's title bar or tab, and it is often used by search engines to display the title of the page in search results.
|
||||
|
||||
```html
|
||||
<title>ESPN</title>
|
||||
```
|
||||
|
||||
### The URL
|
||||
|
||||
This portion will be different from what Google shows, as our search is limited to the [html_files](html_files) folder, the URL we present will just be a path within this folder. Here are some example URLs we use in this assignment:
|
||||
|
||||
```plaintext
|
||||
html_files/file1.html
|
||||
html_files/subdir1/subdir2/file7.html
|
||||
html_files/subdir1/subdir2/subdir3/subdir4/file13.html
|
||||
```
|
||||
|
||||
### The Description
|
||||
|
||||
In all HTML files we provide, in the <head> section of the HTML, we have a meta description tag which provides a brief description of the page's content. This description is often displayed by search engines in search results to give users an idea of what the web page is about. The following is an example:
|
||||
|
||||
```html
|
||||
<meta name="description" content="Boston Celtics Scores, Stats and Highlights">
|
||||
```
|
||||
|
||||
Here, "Boston Celtics Scores, Stats and Highlights" is the description.
|
||||
|
||||
### The Snippet
|
||||
|
||||
This snippet contains an excerpt from the page's content that is directly related to the search query. In this assignment, the requirements for this snippet is:
|
||||
|
||||
1. when constructing the snippet, you should only consider the <body> section of the HTML files. In other words, the snippet must come from the <body> section only.
|
||||
|
||||
2. The snippet should contain exactly 120 characters.
|
||||
|
||||
3.1 For a phrase search, the snippet should start from the beginning of a sentence which contains the query; This means the query itself may not appear in the snippet: this is possible when a sentence contains the query, but that query does not appear in the first 120 characters of the sentence. If the query appears multiple times in a document, consider the first occurrence only. In other words, to construct the snippet, your program should search the first occurrence of the query in the <body> section of the document.
|
||||
|
||||
3.2 For a regular search, if an exact match can be found in the <body> section of the document, the snippet should start from the beginning of a sentence which contains the query, and if the query appears multiple times in the <body> section of the document, consider the first occurrence only; if an exact match can not be found in the <body> section of the document, the snippet should start from the beginning of a sentence which contains the first keyword of the query, and if the first keyword appears multiple times in the <body> section of the document, consider the first occurrence only.
|
||||
|
||||
**Note**, to simplify the construction of the snippets, we have tailored the provided HTML files such that you can identify the beginning of a sentence via searching the period sign before the sentence. In this assignment, you can assume that there is always a period sign before the sentence which contains the snippet you are going to construct, however, it is possible that there are some whitespaces in between the period and the start of the sentence.
|
||||
|
||||
## Useful String Functions
|
||||
|
||||
You may find the following functions to be useful (most of them are string functions, except *std::isspace*):
|
||||
|
||||
- rfind: this function does reverse find in a string. When finding the start position of a sentence which contains a keyword or a query, the string function *rfind*() can be useful, as this function can be used to search a string for the last occurrence of the period sign. For example, if you find that the query starts at position *queryPos*, then you can use the *rfind*() function like this to locate the period sign before the sentence which contains this query:
|
||||
|
||||
```cpp
|
||||
size_t periodPos = data.rfind(".", queryPos);
|
||||
```
|
||||
|
||||
Here *data* is a string which contains the full content of the document. Once you locate the period sign, you can then skip any possible whitespaces to get to the start of the sentence. And in order to skip whitespaces, you may want to use this next function - *std::isspace*().
|
||||
|
||||
- std::isspace: we use this function to check if a given character is a whitespace character.
|
||||
- find: we use this function to search a string for the first occurrence of some character or some substring.
|
||||
- substr: we use this function to get a substring of an existing string.
|
||||
- find_last_of: in this assignment, there might be several situations when you need to find the last slash of a URL. And for that purpose, you can use the *find_last_of*() function. An example usage case is, given the URL "html_files/subdir1/subdir2/file7.html" as a string, if you want to get the directory "html_files/subdir1/subdir2/", you can use *find_last_of*() and *substr*() like this.
|
||||
```cpp
|
||||
std::string directory;
|
||||
// suppose URL is "html_files/subdir1/subdir2/file7.html"
|
||||
size_t lastSlashPos = URL.find_last_of('/');
|
||||
if (lastSlashPos != std::string::npos) {
|
||||
// directory will now be "html_files/subdir1/subdir2/"
|
||||
directory = URL.substr(0, lastSlashPos + 1);
|
||||
}
|
||||
```
|
||||
|
||||
<!-- erase: when doing a phrase search, we enclose our query with double quotes. Unfortunately, the autograder is not smart enough to handle this, and it will pass the double quotes as a part of the query string. And therefore, in your program, you need to remove the double quotes, and you can do so using code like this:
|
||||
|
||||
```cpp
|
||||
size_t quotePos;
|
||||
// unfortunately, autograder will pass \" to the command line, and thus the double quote will be a part of the string.
|
||||
if( (quotePos = tmpString.find('"')) != std::string::npos ){
|
||||
tmpString.erase(quotePos, 1); // remove the double quote character at the found position; here number 1 as the second argument means erasing 1 character.
|
||||
}
|
||||
```
|
||||
|
||||
Here *tmpString* is a string which might contain one double quote character, for example, *tmpString* might be **"Tom**, or it might be **Cruise"**.
|
||||
-->
|
||||
|
||||
## Provided Functions
|
||||
|
||||
Parsing an HTML file and extract all the links from this file may require some regular expression library functions, and using these regular expression library functions is beyond the scope of this course, and thus the following function (which calls regular expression library functions) is provided for you. This function takes a std::string argument, representing the content of an HTML file, and this function will extract all links in this HTML file, and return them as a linked list, represented by an std::list<std::string> object.
|
||||
|
||||
```cpp
|
||||
// function to parse an HTML file and extract links to local files
|
||||
std::list<std::string> extractLinksFromHTML(const std::string& fileContent) {
|
||||
std::list<std::string> links;
|
||||
// regular expression to match href attributes in anchor tags
|
||||
std::regex linkRegex("<a\\s+[^>]*href\\s*=\\s*['\"]([^'\"]+)['\"][^>]*>");
|
||||
std::smatch match;
|
||||
|
||||
// search for links in the HTML content
|
||||
std::string::const_iterator start = fileContent.cbegin();
|
||||
while (std::regex_search(start, fileContent.cend(), match, linkRegex)) {
|
||||
if (match.size() > 1) {
|
||||
links.push_back(match[1].str());
|
||||
}
|
||||
start = match.suffix().first;
|
||||
}
|
||||
|
||||
return links;
|
||||
}
|
||||
```
|
||||
|
||||
In order to use this function, you need to include the regex library like this:
|
||||
|
||||
```cpp
|
||||
#include <regex>
|
||||
```
|
||||
|
||||
## Other Useful Code
|
||||
|
||||
Unlike previous assignments where you read input files and parse it line by line, in this assignment, when you open an HTML file, you may want to store the full content of this file into a string. For example, you want to open the file file3.html, whose path is "html_files/subdir1/file3.html", and store the full content of this file into a string, then you can do this:
|
||||
|
||||
```cpp
|
||||
std::ifstream fileStream(filePath);
|
||||
if (fileStream.is_open()) {
|
||||
std::string fileContent((std::istreambuf_iterator<char>(fileStream)), std::istreambuf_iterator<char>());
|
||||
// suppose filePath is the string "html_files/subdir1/file3.html", then at this point, the string fileContent will be the full content of this file file3.html.
|
||||
// do something with fileContent
|
||||
}
|
||||
```
|
||||
|
||||
Make sure you still include the fstream library.
|
||||
|
||||
```cpp
|
||||
#include <fstream>
|
||||
```
|
||||
|
||||
## Program Requirements & Submission Details
|
||||
|
||||
In this assignment, you are required to use either std::map or std::set. You can use both if you want to. You are NOT allowed to use any data structures we have not learned so far, but feel free to use any data structures we have already learned, such as std::string, std::vector, std::list. In addition, **the web crawler component of your program must be recursive**.
|
||||
|
||||
Use good coding style when you design and implement your program. Organize your program into functions:
|
||||
don’t put all the code in main! Be sure to read the [Homework Policies](https://www.cs.rpi.edu/academics/courses/spring24/csci1200/homework_policies.php) as you put the finishing touches on your solution. Be sure to make up new test cases to fully debug your program and don’t forget
|
||||
to comment your code! Use the provided template [README.txt](./README.txt) file for notes you want the grader to read.
|
||||
You must do this assignment on your own, as described in the [Collaboration Policy & Academic Integrity](https://www.cs.rpi.edu/academics/courses/spring24/csci1200/academic_integrity.php) page. If you did discuss the problem or error messages, etc. with anyone, please list their names in your README.txt file.
|
||||
|
||||
**Due Date**: 03/21/2024, Thursday, 10pm.
|
||||
|
||||
<!--## Instructor's Code
|
||||
|
||||
You can test (but not view) the instructor's code here: [instructor code](http://cs.rpi.edu/~xiaoj8/ds/search/). Note that this page just uses a copy of Google's homepage to serve as the front end, and at the back end it runs the instructor's C++ code. This page does not support the "enter" key, you need to press the "New York Search" button to submit a query.-->
|
||||
|
||||
## Rubric
|
||||
|
||||
21 pts
|
||||
- README.txt Completed (3 pts)
|
||||
- One of name, collaborators, or hours not filled in. (-1)
|
||||
- Two or more of name, collaborators, or hours not filled in. (-2)
|
||||
- No reflection. (-1)
|
||||
- IMPLEMENTATION AND CODING STYLE (Good class design, split into a .h and .cpp file. Functions > 1 line are in .cpp file. Organized class implementation and reasonable comments throughout. Correct use of const/const& and of class method const. ) (8 pts)
|
||||
- No credit (significantly incomplete implementation) (-8)
|
||||
- Putting almost everything in the main function. It's better to create separate functions for different tasks. (-2)
|
||||
- Function bodies containing more than one statement are placed in the .h file. (okay for templated classes) (-2)
|
||||
- Missing include guards in the .h file. (Or does not declare them correctly) (-1)
|
||||
- Functions are not well documented or are poorly commented, in either the .h or the .cpp file. (-1)
|
||||
- Improper uses or omissions of const and reference. (-1)
|
||||
- At least one function is excessively long (i.e., more than 200 lines). (-1)
|
||||
- Overly cramped, excessive whitespace, or poor indentation. (-1)
|
||||
- Poor file organization: Puts more than one class in a file (okay for very small helper classes) (-1)
|
||||
- Poor choice of variable names: non-descriptive names (e.g. 'vec', 'str', 'var'), single-letter variable names (except single loop counter), etc. (-2)
|
||||
- Contains useless comments like commented-out code, terminal commands, or silly notes. (-1)
|
||||
- DATA REPRESENTATION (7 pts)
|
||||
- Uses data structures which have not been covered in this class. (-7)
|
||||
- Uses filesystem library (i.e., #include <filesystem> ). (-7)
|
||||
- Neither std::map nor std::set is used. (-7)
|
||||
- Member variables are public. (-2)
|
||||
- RECURSION (3 pts)
|
||||
- Does not use recursion in the web crawler component. (-3)
|
||||
|
||||
## Appendix A - HTML File Basics
|
||||
|
||||
A typical HTML file consists of two main sections: the <head> section and the <body> section.
|
||||
|
||||
1. The <head> section contains metadata about the document, such as its title, character encoding, stylesheets, scripts, and other information that is not directly displayed on the web page.
|
||||
|
||||
2. The <body> section contains the actual content of the document that is displayed to the user, such as text, images, links, and other elements.
|
||||
|
||||
These two sections together define the structure and content of an HTML document. The following is an example, it is a basic html file.
|
||||
|
||||
```html
|
||||
1. <!DOCTYPE html>
|
||||
2. <html lang="en">
|
||||
3. <head>
|
||||
4. <meta charset="UTF-8">
|
||||
5. <meta name="description" content="Example HTML file with head and body sections">
|
||||
6. <meta name="keywords" content="HTML, example, head, body">
|
||||
7. <meta name="author" content="Your Name">
|
||||
8. <title>Example HTML File</title>
|
||||
9. </head>
|
||||
10. <body>
|
||||
11. <h1>Welcome to My Website</h1>
|
||||
12. <p>This is the body content of the HTML file. You can add any content you like here.</p>
|
||||
13. <ul>
|
||||
14. <li><a href="https://example.com">Example Website</a></li>
|
||||
15. <li><a href="https://www.w3schools.com/html/">HTML Tutorial</a></li>
|
||||
16. <li><a href="https://developer.mozilla.org/en-US/docs/Web/HTML">MDN Web Docs: HTML</a></li>
|
||||
17. </ul>
|
||||
18. </body>
|
||||
19. </html>
|
||||
```
|
||||
|
||||
Here:
|
||||
|
||||
- line 3 and line 9 marks the head section of this html file.
|
||||
|
||||
- line 10 and line 18 marks the body section of this html file.
|
||||
|
||||
- line 5 is the description tag.
|
||||
|
||||
- line 8 is the title tag.
|
||||
|
||||
- line 14, line 15, and line 16 are some outgoing links.
|
||||
@@ -1,37 +0,0 @@
|
||||
HOMEWORK 7: Search Engine
|
||||
|
||||
|
||||
NAME: < insert name >
|
||||
|
||||
|
||||
COLLABORATORS AND OTHER RESOURCES:
|
||||
List the names of everyone you talked to about this assignment
|
||||
(classmates, TAs, ALAC tutors, upperclassmen, students/instructor via
|
||||
LMS, etc.), and all of the resources (books, online reference
|
||||
material, etc.) you consulted in completing this assignment.
|
||||
|
||||
< insert collaborators / resources >
|
||||
|
||||
Remember: Your implementation for this assignment must be done on your
|
||||
own, as described in "Academic Integrity for Homework" handout.
|
||||
|
||||
|
||||
ESTIMATE OF # OF HOURS SPENT ON THIS ASSIGNMENT: < insert # hours >
|
||||
|
||||
|
||||
MISC. COMMENTS TO GRADER:
|
||||
(optional, please be concise!)
|
||||
|
||||
|
||||
## Reflection and Self Assessment
|
||||
|
||||
Discuss the issues you encountered during development and testing. What
|
||||
problems did you have? What did you have to research and learn on your
|
||||
own? What kinds of errors did you get? How did you fix them?
|
||||
|
||||
What parts of the assignment did you find challenging? Is there anything that
|
||||
finally "clicked" for you in the process of working on this assignment? How well
|
||||
did the development and testing process go for you?
|
||||
|
||||
< insert reflection >
|
||||
|
||||
@@ -1,23 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Bleacher Report</title>
|
||||
<meta name="description" content="Tom Brady - Tampa Bay Buccaneers Quarterback">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/file1.html</h1>
|
||||
<p><a href="file2.html">file2.html</a></p>
|
||||
<p><a href="subdir1/file3.html">file3.html</a></p>
|
||||
<p><a href="subdir1/subdir2/file4.html">file4.html</a></p>
|
||||
New England Patriots fans appear fed up with Bill Belichick. Frankly, it's surprising. For the better part of the past 20 years, legions of Patriots fans lived by the mantra "In Bill We Trust." With Belichick given carte blanche to run basically everything on the football side of the business, the team enjoyed an unprecedented run of success that included six Super Bowl wins.
|
||||
The New York Jets did not make a call to Tom Brady after Aaron Rodgers was lost for the season to a torn Achilles, sources say, and the all-time great has made it very clear that he's retired for real this time.
|
||||
|
||||
However, the door will officially remain open for Brady to play for any team -- at least until October and quite possibly longer.
|
||||
|
||||
Brady's purchase of a minority stake in the Raiders has not been finalized, sources say. A vote could come as soon as next month's Fall League Meeting, sources say, but there is still plenty to sort out with financing and plenty of information regarding a potential purchase still to be shared.
|
||||
|
||||
There's no guarantee the process is done by October, which means the Winter League Meeting in December could be the time for Brady to finally and firmly close the door on playing in the NFL.
|
||||
Cruise.com
|
||||
Cruise ships are large passenger ships used mainly for vacationing. Unlike ocean liners, which are used for transport, cruise ships typically embark on round-trip voyages to various ports of call, where passengers may go on tours known as "shore excursions".
|
||||
<last-updated>July 18, 2000</last-updated></body>
|
||||
</html>
|
||||
@@ -1,18 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Wikipedia</title>
|
||||
<meta name="description" content="Tom and Jerry">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/file2.html</h1>
|
||||
<p><a href="subdir1/subdir2/file4.html">file4.html</a></p>
|
||||
<p><a href="subdir1/subdir2/file5.html">file5.html</a></p>
|
||||
<p><a href="file1.html">file1.html</a></p>
|
||||
This article is about the animal cartoon series. For the titular characters, see Tom Cat and Jerry Mouse. Tom and Jerry is an American animated media franchise and series of comedy short films created in 1940 by William Hanna and Joseph Barbera. Best known for its 161 theatrical short films by Metro-Goldwyn-Mayer, the series centers on the rivalry between the titular characters of a cat named Tom and a mouse named Jerry. Many shorts also feature several recurring characters.
|
||||
|
||||
In its original run, Hanna and Barbera produced 114 Tom and Jerry shorts for MGM from 1940 to 1958.[1] During this time, they won seven Academy Awards for Best Animated Short Film, tying for first place with Walt Disney's Silly Symphonies with the most awards in the category. After the MGM cartoon studio closed in 1957, MGM revived the series with Gene Deitch directing an additional 13 Tom and Jerry shorts for Rembrandt Films from 1961 to 1962. Tom and Jerry then became the highest-grossing animated short film series of that time, overtaking Looney Tunes. Chuck Jones then produced another 34 shorts with Sib Tower 12 Productions between 1963 and 1967. Five more shorts have been produced since 2001, making a total of 166 shorts.
|
||||
|
||||
A number of spin-offs have been made, including the television series The Tom and Jerry Show (1975), The Tom and Jerry Comedy Show (1980–1982), Tom & Jerry Kids (1990–1993), Tom and Jerry Tales (2006–2008), and The Tom and Jerry Show (2014–2021). The first feature-length film based on the series, Tom and Jerry: The Movie, was released in 1992, and 13 direct-to-video films have been produced since 2002, with a live-action/animated hybrid film released in 2021. A musical adaptation of the series, titled Tom and Jerry: Purr-Chance to Dream, debuted in Japan in 2019 in advance of Tom and Jerry's 80th anniversary.
|
||||
<last-updated>February 21, 2011</last-updated></body>
|
||||
</html>
|
||||
@@ -1,35 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>CNN.com</title>
|
||||
<meta name="description" content="CNN: Breaking News, Latest News and Videos">
|
||||
</head>
|
||||
<body>
|
||||
|
||||
Content by Dr. Marty's
|
||||
How dogs cry for help: 3 warning signs to watch
|
||||
IN THE SPOTLIGHT
|
||||
Pete Davidson delivered moving remarks in the opening minutes of 'Saturday Night Live' on October 14, 2023.
|
||||
Pete Davidson made emotional and deeply personal remarks during the ‘SNL’ opening
|
||||
Opinion: Pete Davidson’s perfect plea
|
||||
China’s movie theaters thrive as economic gloom descends. Hollywood is missing out
|
||||
Taylor Swift and Travis Kelce’s whirlwind week caps off with ‘SNL’ cameos – and some hand-holding
|
||||
Taylor Swift has a cameo on ‘SNL’
|
||||
Piper Laurie, Oscar-nominated ‘Carrie’ actress, dies.
|
||||
|
||||
View the latest news and breaking news today at CNN.com. CNN operates as a division of Turner Broadcasting System, which is a subsidiary of Warner Media.
|
||||
PODCASTS
|
||||
One Thing_Social Card - 1600x900 (2).png
|
||||
<h1>html_files/index.html</h1>
|
||||
<p><a href="file1.html">file1.html</a></p>
|
||||
<p><a href="file2.html">file2.html</a></p>
|
||||
|
||||
The Cable News Network (CNN) is a multinational news channel and website headquartered in Atlanta, Georgia, U.S.[2][3][4] Founded in 1980 by American media proprietor Ted Turner and Reese Schonfeld as a 24-hour cable news channel, and presently owned by the Manhattan-based media conglomerate Warner Bros. Discovery (WBD),[5] CNN was the first television channel to provide 24-hour news coverage and the first all-news television channel in the United States.[6][7][8][9][10]
|
||||
|
||||
As of February 2023, CNN had 80 million television households as subscribers in the US.[11] According to Nielsen, in June 2021 CNN ranked third in viewership among cable news networks, behind Fox News and MSNBC, averaging 580,000 viewers throughout the day, down 49% from a year earlier, amid sharp declines in viewers across all cable news networks.[12] While CNN ranked 14th among all basic cable networks in 2019,[13][14] then jumped to 7th during a major surge for the three largest cable news networks (completing a rankings streak of Fox News at number 5 and MSNBC at number 6 for that year),[15] it settled back to number 11 in 2021[16] and had further declined to number 21 in 2022.[17]
|
||||
|
||||
Globally, CNN programming has aired through CNN International, seen by viewers in over 212 countries and territories;[18] since May 2019, however, the US domestic version has absorbed international news coverage in order to reduce programming costs. The American version, sometimes referred to as CNN (US), is also available in Canada, and some islands in the Caribbean. CNN also broadcasts in India where it is called CNN-News18, and in Japan, where it was first broadcast on CNNj in 2003, with simultaneous translation in Japanese.[19]
|
||||
|
||||
<!-- Custom element for last updated date -->
|
||||
<last-updated>December 07, 2014</last-updated></body>
|
||||
</html>
|
||||
@@ -1,28 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Sports Illustrated</title>
|
||||
<meta name="description" content="Celtics Hire Former Knicks Coach - Sports Illustrated">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/subdir1/file3.html</h1>
|
||||
<p><a href="../file1.html">file1.html</a></p>
|
||||
<p><a href="subdir2/file6.html">file6.html</a></p>
|
||||
|
||||
The National WWII Museum commemorates the Day That Will Live in Infamy through articles, oral histories, artifacts, and more.
|
||||
Remembering Pearl Harbor: No moment in the history of the United States casts a longer shadow than Pearl Harbor. “Remembering” it has become a national imperative, a patriotic duty for the American people, and reminding us of that duty has become a ritual of media and political discourse—repeated so often and in so many ways that it’s become part of the routine of our communal life.
|
||||
|
||||
- Rob Citino, PhD
|
||||
Pearl Harbor Attack, December 7, 1941
|
||||
|
||||
Celtics Hire Former Knicks Coach.
|
||||
In Jeff Van Gundy, the Boston Celtics added a consultant with 11 years of experience as an NBA head coach. Former New York Knicks head coach Jeff Van Gundy is joining the Boston Celtics in a senior consultant role assisting on all basketball operations matters, not just coaching. The Celtics reportedly considered adding him to Joe Mazzulla's coaching staff back in July.
|
||||
|
||||
Find any Statue for sale - Perfect for your project, from life size to small table top sculptures.
|
||||
The Statue, the story of a monument to the greatest Philadelphian who never lived, from WHYY Digital Studios. Coming in 2023.
|
||||
Liberty Mutual Group is an American diversified global insurer and the sixth-largest property and casualty insurer in the United States.
|
||||
|
||||
Liberty Mutual may be a good choice for customers seeking several types of insurance products and enjoy the flexibility of working both with an agent and through digital self-service. From basic auto and home insurance to life, small business and commercial, Liberty Mutual may be able to meet all your coverage needs in one company.
|
||||
|
||||
<last-updated>October 12, 2007</last-updated></body>
|
||||
</html>
|
||||
@@ -1,52 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>ESPN</title>
|
||||
<meta name="description" content="Boston Celtics 2023-24 NBA Roster">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/subdir1/subdir2/file4.html</h1>
|
||||
<p><a href="../../file1.html">file1.html</a></p>
|
||||
<p><a href="subdir3/subdir5/file14.html">file14.html</a></p>
|
||||
Woj: 'No progress' on a Klay Thompson extension Adrian Wojnarowski and former Warriors GM Bob Myers explain the delicate negotiations with Klay Thompson on a contract extension.
|
||||
Boston Celtics Roster 2023-24 More NBA Teams Team Roster NAME POS AGE HT WT COLLEGE SALARY https://a.espncdn.com/i/headshots/nba/players/full/4397885.png
|
||||
Dalano Banton45 G 23 6' 7" 204 lbs Nebraska $2,019,706
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/4278031.png
|
||||
Oshae Brissett12 SF 25 6' 7" 210 lbs Syracuse $2,165,000
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/3917376.png
|
||||
Jaylen Brown7 SG 26 6' 6" 223 lbs California $31,830,357
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/4576085.png
|
||||
JD Davison20 SG 21 6' 1" 195 lbs Alabama --
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/4066264.png
|
||||
Wenyen Gabriel13 F 26 6' 9" 205 lbs Kentucky --
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/4065804.png
|
||||
Sam Hauser30 SF 25 6' 7" 217 lbs Virginia $1,927,896
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/3995.png
|
||||
Jrue Holiday4 PG 33 6' 4" 205 lbs UCLA $36,861,707
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/3213.png
|
||||
Al Horford42 C 37 6' 9" 240 lbs Florida $10,000,000
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/3064560.png
|
||||
Luke Kornet40 C 28 7' 2" 250 lbs Vanderbilt $2,413,304
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/3133602.png
|
||||
Svi Mykhailiuk50 SG 26 6' 7" 205 lbs Kansas $2,019,706
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/3102531.png
|
||||
Kristaps Porzingis8 C 28 7' 3" 240 lbs -- $36,016,200
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/4066354.png
|
||||
Payton Pritchard11 PG 25 6' 1" 195 lbs Oregon $4,037,278
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/4397424.png
|
||||
Neemias Queta88 C 24 7' 0" 245 lbs Utah State $2,019,706
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/4683023.png
|
||||
Jay Scrubb55 G 23 6' 5" 220 lbs John A. Logan College --
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/4066405.png
|
||||
Lamar Stevens77 PF 26 6' 6" 230 lbs Penn State $1,930,681
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/4701231.png
|
||||
DJ Steward26 G 22 6' 1" 162 lbs Duke --
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/4065648.png
|
||||
Jayson Tatum0 SF 25 6' 8" 210 lbs Duke $32,600,060
|
||||
https://a.espncdn.com/combiner/i?img=/i/headshots/nophoto.png&w=200&h=146
|
||||
Jordan Walsh27 G 19 6' 7" 205 lbs Arkansas $1,119,563
|
||||
https://a.espncdn.com/i/headshots/nba/players/full/3078576.png
|
||||
Derrick White9 PG 29 6' 4" 190 lbs Colorado $18,357,143
|
||||
Coach Joe Mazzulla
|
||||
<last-updated>September 21, 2016</last-updated></body>
|
||||
</html>
|
||||
@@ -1,21 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Instagram</title>
|
||||
<meta name="description" content="Lady Gaga (@ladygaga) • Instagram photos and videos">
|
||||
</head>
|
||||
<body>
|
||||
Foundation at the Westfield Century City LA Kinder, Braver Together Pop-Up open through Oct 10.
|
||||
56M Followers, 97 Following, 3639 Posts - See Instagram photos and videos from Lady Gaga (@ladygaga). Lady Gaga Jazz & Piano returns to Las Vegas for 12 shows between August 31 and October 5 Sign up now for the Little Monsters pre-sale on http://vegas.
|
||||
|
||||
Lil Eddie - Statue Tribute to all my supporters who shared my song made cover videos, and more! I appreciate and love everyone one of you!
|
||||
The Rocky Statue and the “Rocky Steps” are undeniably two of the most popular attractions in Philadelphia.
|
||||
|
||||
At Liberty University, you can tailor your associate, bachelor’s, master’s, or doctoral degree to match your area of interest while studying at our Central Virginia campus or globally online. Certificates are also available at both the undergraduate and graduate level.
|
||||
|
||||
Join us at Liberty University where you’ll prepare for a successful career while we prepare you for life. With over 700 programs of study, state-of-the-art technology, and a gifted faculty, Liberty will equip you to enter a competitive job market. Learn, develop, and grow at Liberty so you can impact your culture as a Champion for Christ.
|
||||
<h1>html_files/subdir1/subdir2/file5.html</h1>
|
||||
<p><a href="../../file2.html">file2.html</a></p>
|
||||
<p><a href="subdir3/subdir5/file14.html">file14.html</a></p>
|
||||
<last-updated>July 11, 2008</last-updated></body>
|
||||
</html>
|
||||
@@ -1,21 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>StatueOfLibertyTickets.com</title>
|
||||
<meta name="description" content="Statue of Liberty Tickets, Ellis Island Tickets, Statue of Liberty...">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/subdir1/subdir2/file6.html</h1>
|
||||
<p><a href="../file3.html">file3.html</a></p>
|
||||
<p><a href="file7.html">file7.html</a></p>
|
||||
|
||||
Thank you for visiting www.statueoflibertytickets.com. Reservations and E-tickets are genuine for the tour which includes the ferry service. We also provide extended around-the-clock customer care seven days a week, including holidays. This website is not affiliated with any government entity and is not the official box office. Ticket price exceeds face value. We are a private Customer Service and Booking company. Our goal is 100% customer satisfaction. Please share your experience with us at: Customer Care.
|
||||
|
||||
Visit the Statue of Liberty National Monument and Ellis Island Immigration Museum. Ferry
|
||||
service to both Liberty and Ellis Island is the only way to get there. Tours sell out quickly to the Statue of Liberty and Ellis Island, so reserve your tickets now and avoid long lines.
|
||||
|
||||
Stefani Joanne Angelina Germanotta, known professionally as Lady Gaga, is an American singer, songwriter, and actress. She is known for her image reinventions and versatility in the entertainment industry. Gaga began performing as a teenager, singing at open mic nights and acting in school plays.
|
||||
|
||||
Did You Know? There are concession stands on all the boats and also concession stands and gift stores on both Liberty and Ellis Island.
|
||||
<last-updated>September 12, 2012</last-updated></body>
|
||||
</html>
|
||||
@@ -1,22 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>The Kennedy Center</title>
|
||||
<meta name="description" content="Tom Hanks Biography">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/subdir1/subdir2/file7.html</h1>
|
||||
<p><a href="file6.html">file6.html</a></p>
|
||||
<p><a href="subdir3/file8.html">file8.html</a></p>
|
||||
Thomas Jeffrey Hanks was born in Concord, California, to Janet Marylyn (Frager), a hospital worker, and Amos Mefford Hanks, an itinerant cook.
|
||||
(Actor, director, producer, writer, born July 9, 1956 in Concord, California).
|
||||
|
||||
When Reader's Digest did a poll in 2013 to find out who are the 100 Most Trusted People in America, Tom Hanks came out as No. 1. It's no surprise that the much-beloved star of such hits as Big, Forrest Gump, Castaway, and Toy Story is one of the most powerful and well-respected actors in Hollywood. His accessibility and charisma have earned him comparisons with such screen legends as Jimmy Stewart, Cary Grant and Gary Cooper. Tom Hanks is the "everyman" whose portrayals make us reconsider our own moral choices. Watching him on screen, we think about what we would do--and have done. "And that's why we all love you so much," said Oprah in a 2001 interview with Hanks. "We recognize ourselves in your characters."
|
||||
Tom Hanks
|
||||
Sail the High Seas & Explore Top Travel Destinations with Our Best Cruise Vacation Deals.
|
||||
Hawaii Cruises
|
||||
Book Your Next Adventure To The Beautiful Hawaiian Islands.
|
||||
Caribbean Cruises
|
||||
Hundreds of Caribbean Cruise Deals To Choose From. Book Yours Today!
|
||||
<last-updated>November 20, 2007</last-updated></body>
|
||||
</html>
|
||||
@@ -1,32 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Amazon</title>
|
||||
<meta name="description" content="Watch Tom and Jerry: The Complete First Volume - Amazon">
|
||||
</head>
|
||||
<body>
|
||||
S1 E15 - The Lonesome Mouse
|
||||
|
||||
May 21, 1943
|
||||
8min
|
||||
ALL.
|
||||
In this decades-old rivalry, Tom Cat and Jerry Mouse match wits against each other in numerous situations and settings. Jerry is bored and lonely after Tom gets booted from the house so he decides to get him back inside.
|
||||
Available to buy
|
||||
S1 E16 - The Flying Sorceress
|
||||
|
||||
January 11, 1952
|
||||
7min
|
||||
ALL
|
||||
Tom steals a witch's flying broom so he can scare the wits out of Jerry.
|
||||
Available to buy
|
||||
<h1>html_files/subdir1/subdir2/subdir3/file10.html</h1>
|
||||
<p><a href="subdir4/file11.html">file11.html</a></p>
|
||||
<p><a href="file8.html">file8.html</a></p>
|
||||
S1 E17 - Johann Mouse
|
||||
A narrator tells the story of a Waltzing mouse named Johann (Jerry) lived in Vienna in Johann Strauss home. Tom try to catch the mouse each time he's dancing but a day in absence of Johann Strauss, Tom decided to learn to playing piano for catch him. S1 E18 - Tom and Jerry in the Hollywood Bowl September 15, 1950 7min. ALL Jerry wants to help Tom out as a conductor at the Hollywood Bowl but the cat is having none of it.
|
||||
Available to buy
|
||||
|
||||
What’s Included in the Price of a Disney Cruise
|
||||
From the moment you step on board to your last wave goodbye, a Disney cruise includes almost everything you could hope for—larger-than-life entertainment, outstanding dining, spacious accommodations, dazzling destinations and more—with most experiences included in the price of your cruise.
|
||||
<last-updated>July 16, 2015</last-updated></body>
|
||||
</html>
|
||||
@@ -1,24 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Warner Bros.</title>
|
||||
<meta name="description" content="Tom & Jerry | Movies">
|
||||
</head>
|
||||
<body>
|
||||
ABOUT
|
||||
In Theaters and HBO Max on February 26, 2021. “Tom and Jerry” is one of the leading franchises in the Warner Bros. One of the most beloved rivalries in history is reignited when Jerry moves into New York City’s finest hotel on the eve of “the wedding of the century,” forcing the event’s desperate planner to hire Tom to get rid of him, in director Tim Story’s “Tom and Jerry.” The ensuing cat and mouse battle threatens to destroy her career, the wedding and possibly the hotel itself. But soon, an even bigger problem arises: a diabolically ambitious staffer conspiring against all three of them.
|
||||
|
||||
An eye-popping blend of classic animation and live action, Tom and Jerry’s new adventure stakes new ground for the iconic characters and forces them to do the unthinkable… work together to save the day.
|
||||
|
||||
"Tom and Jerry” stars Chloë Grace Moretz (“Neighbors 2: Sorority Rising,” “The Addams Family”), Michael Peña (“Cesar Chavez,” “American Hustle,” “Ant-Man”), Colin Jost (“How to be Single,” “Saturday Night Live”), Rob Delaney (“Deadpool 2,” “Fast & Furious Presents: Hobbs & Shaw”), Pallavi Sharda (“Lion”), Jordan Bolger (“Peaky Blinders”), Patsy Ferran (“Darkest Hour”), Nicky Jam (TV’s “Nicky Jam: El Ganador”), Bobby Cannavale (“The Irishman,” “Ant-Man and the Wasp”), Lil Rel Howery (upcoming “Judas and the Black Messiah,” “The Angry Birds Movie 2”), and Ken Jeong (“Crazy Rich Asians,” “The Hangover,” “Transformers: Dark of the Moon”).
|
||||
|
||||
The film is directed by Tim Story (“Fantastic Four,” “Think Like a Man,” “Barbershop”) and produced by Chris DeFaria (“The LEGO Movie 2,” “Ready Player One,” “Gravity”). It is written by Kevin Costello, based on characters created by William Hanna and Joseph Barbera. Serving as executive producers are Tim Story, Adam Goodman, Steven Harding, Sam Register, Jesse Ehrman, and Allison Abbate. The creative filmmaking team includes director of photography Alan Stewart, production designer James Hambidge, editor Peter S. Elliot, and costume designer Alison McCosh. The music is composed by Christopher Lennertz.
|
||||
|
||||
A Warner Bros. Pictures and Warner Animation Group presentation, a Tim Story Film, “Tom and Jerry” is slated for release in theaters nationwide on February 26, 2021 and will be available in the U.S. on HBO Max for 31 days from theatrical release. The film will be distributed worldwide by Warner Bros. Pictures.
|
||||
|
||||
The film is rated PG for cartoon violence, rude humor and brief language.
|
||||
<h1>html_files/subdir1/subdir2/subdir3/file8.html</h1>
|
||||
<p><a href="file9.html">file9.html</a></p>
|
||||
<p><a href="file10.html">file10.html</a></p>
|
||||
<last-updated>June 17, 2010</last-updated></body>
|
||||
</html>
|
||||
@@ -1,30 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>CBS Sports</title>
|
||||
<meta name="description" content="Tom Brady on coming out of retirement again: 'I wouldn't be...">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/subdir1/subdir2/subdir3/file9.html</h1>
|
||||
<p><a href="file10.html">file10.html</a></p>
|
||||
<p><a href="subdir4/file11.html">file11.html</a></p>
|
||||
Hanna Rose Hall (born July 9, 1984) is an American actress and assistant director, known for her roles as Young Jenny in Forrest Gump (1994).
|
||||
Tom Brady Tom Brady on coming out of retirement again: 'I wouldn't be around by tonight'. Brady joked that he isn't going to unretire for a second time.
|
||||
Tom Ince is ready to shut out the background noise and make a big impression at Watford after enduring one of the toughest seasons of his career.
|
||||
Tom Hanks, After graduating high school in Oakland, Hanks attended junior college before transferring into the theater program at California State University. He spent his summers acting and working at the Great Lakes Shakespeare Festival in Ohio until 1980 when he dropped out of college and moved to New York City. That same year, Hanks was cast as Kip Wilson on the sitcom Bosom Buddies, one of two advertising executives who dress in drag in order to rent an apartment in an all-female building. The exposure led to guest roles on Happy Days, Taxi, The Love Boat, and Family Ties.
|
||||
Tom Cruise, Despite appearing in a whole range of movies from romance to dramas, there's one genre that Tom Cruise dominates in - action films.
|
||||
Whether it's playing Ethan Hunt, an agent of the Impossible Missions Force or US navy pilot LT Pete 'Maverick' Mitchell in Top Gun, the actor can usually found in the midst of a high octaine situation or fight scene.
|
||||
|
||||
However, Cruise has an unexpected clause in his movie contracts that most people don't know about.
|
||||
|
||||
Hanna (she/her) has been working professionally in the film industry for over 20 years. She won the Young Actors Award for her performance in Forrest Gump.
|
||||
Chaotic cat-and-mouse duo Tom and Jerry are up to their old tricks — setting traps and causing trouble — in this reboot of the classic franchise.
|
||||
The series features comic fights between an iconic set of adversaries, a house cat (Tom) and a mouse (Jerry). The plots of each short usually center on Tom's numerous attempts to capture Jerry and the mayhem and destruction that follows. Tom rarely succeeds in catching Jerry, mainly because of Jerry's cleverness, cunning abilities, and luck. However, on several occasions, they have displayed genuine friendship and concern for each other's well-being. At other times, the pair set aside their rivalry in order to pursue a common goal, such as when a baby escapes the watch of a negligent babysitter, causing Tom and Jerry to pursue the baby and keep it away from danger, in the shorts Busy Buddies and Tot Watchers respectively. Despite their endless attacks on one another, they have saved each other's lives every time they were truly in danger, except in The Two Mouseketeers, which features an uncharacteristically morbid ending, and Blue Cat Blues, where both sit on a railroad track at the end after being jilted by girlfriends. The cartoon irises out with the whistle of an oncoming steam train.
|
||||
|
||||
The cartoons are known for some of the most violent cartoon gags ever devised in theatrical animation: Tom may use axes, hammers, firearms, firecrackers, explosives, traps and poison to kill Jerry. On the other hand, Jerry's methods of retaliation are far more violent, with frequent success, including slicing Tom in half, decapitating him, shutting his head or fingers in a window or a door, stuffing Tom's tail in a waffle iron or a mangle, kicking him into a refrigerator, getting him electrocuted, pounding him with a mace, club or mallet, letting a tree or electric pole drive him into the ground, sticking matches into his feet and lighting them, tying him to a firework and setting it off, and so on.[2] While Tom and Jerry has often been criticized as excessively violent, there is no blood or gore in any scene.[3]: 42 [4]: 134
|
||||
|
||||
Music plays a very important part in the shorts, emphasizing the action, filling in for traditional sound effects, and lending emotion to the scenes. Musical director Scott Bradley created complex scores that combined elements of jazz, classical, and pop music; Bradley also often used contemporary pop songs and songs from other films, including MGM films like The Wizard of Oz and Meet Me in St. Louis, which both starred Judy Garland in a leading role.
|
||||
|
||||
Generally, there is little dialogue as Tom and Jerry almost never speak; however, minor characters are not similarly limited, and the two lead characters do speak English on rare occasions. For example, the character Mammy Two Shoes has lines in nearly every cartoon in which she appears. Most of the vocal effects used for Tom and Jerry are their high-pitched laughs and gasping screams.
|
||||
<last-updated>October 27, 2011</last-updated></body>
|
||||
</html>
|
||||
@@ -1,18 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Twitter</title>
|
||||
<meta name="description" content="CNN Breaking News (@cnnbrk) / X">
|
||||
</head>
|
||||
<body>
|
||||
The meaning of LADY is a woman having proprietary rights or authority especially as a feudal superior. How to use lady in a sentence.
|
||||
The word lady is a term for a girl or woman, with various connotations.
|
||||
Lady definition, a woman who is refined, polite, and well-spoken: She may be poor and have little education, but she's a real lady. See more.
|
||||
Landlord charged with hate crimes after fatally stabbing a 6-year-old and wounding his mom because they were Muslim, authorities say.
|
||||
Breaking news from CNN Digital. Now 64M strong. Check. @cnn. for all things CNN, breaking and more. Download the app for custom alerts: http://cnn.com/apps.
|
||||
Lady Gaga is off the deep end with her rumored new beau Michael Polansky. Here's everything to know about the tech investor romancing Gaga ...
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir4/file11.html</h1>
|
||||
<p><a href="file12.html">file12.html</a></p>
|
||||
<p><a href="file13.html">file13.html</a></p>
|
||||
<last-updated>February 12, 2002</last-updated></body>
|
||||
</html>
|
||||
@@ -1,15 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Apple</title>
|
||||
<meta name="description" content="CNN: Breaking US & World News 12+ - App Store">
|
||||
</head>
|
||||
<body>
|
||||
Introducing iPhone 15 Pro, iPhone 15, Apple Watch Series 9 and Apple Watch Ultra 2. Get $250-$650 off a new iPhone 15 when you trade in an iPhone 11 or higher. Terms Apply. Apple Trade In. Chat for shopping help. Free shipping or pickup. Payment options available.
|
||||
Stay informed on breaking news with the global newsgathering power of CNN. CNN has correspondents and bureaus reporting from across the world to keep you
|
||||
updated and informed on what matters most.
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir4/file12.html</h1>
|
||||
<p><a href="file11.html">file11.html</a></p>
|
||||
<p><a href="../subdir5/subdir6/subdir7/file17.html">file17.html</a></p>
|
||||
<last-updated>April 07, 2017</last-updated></body>
|
||||
</html>
|
||||
@@ -1,41 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Ellis Island Foundation</title>
|
||||
<meta name="description" content="Liberty 135 | Statue of Liberty & Ellis Island">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir4/file13.html</h1>
|
||||
<p><a href="file11.html">file11.html</a></p>
|
||||
<p><a href="../subdir5/subdir6/subdir7/file17.html">file17.html</a></p>
|
||||
|
||||
Upon accepting the Statue from France, the U.S. committed to paying for the pedestal’s construction. The project’s organizers anticipated support from New York’s many titans of industry or the U.S. government. However, neither heeded the call. In 1885, as the Statue sailed to America, the pedestal was incomplete.
|
||||
|
||||
Hungarian immigrant and newspaper publisher Joseph Pulitzer saw this as an opportunity. “[It would be} an irrevocable disgrace to New York City and the American Republic to have France send us this splendid gift without our having provided even so much as a landing place for it,” he proclaimed.
|
||||
|
||||
Pulitzer inspired small donations from millions of Americans, printing in his newspaper, The World, the names of every contributor. His plan was a huge success, raising the remaining funds in a matter of weeks.
|
||||
|
||||
Since 1982, The Statue of Liberty-Ellis Island Foundation has partnered with the
|
||||
National Park Service to care for one of the world’s most beloved symbols-Liberty Enlightening the World.
|
||||
|
||||
Some exciting activities were initiated in conjunction with her 135th birthday (October 28):
|
||||
|
||||
The Foundation was invited to participate in “Honor Your Hometown,” an initiative by documentary producer Ken Burns featuring short videos from an extraordinary group of entertainers; representatives of the nation’s most iconic historic sites and museums; noted civic leaders; and many more.
|
||||
|
||||
In 1982, The Statue of Liberty-Ellis Island Foundation was established to raise funds for and oversee the Statue’s historic centennial restoration. Once again, Americans rose to the occasion.
|
||||
|
||||
With individual contributions, purchases of Liberty commemoratives, and corporate donations, the Foundation raised the needed funds, completing the restoration without any government funding.
|
||||
|
||||
The nation celebrated together over Liberty Weekend in July 1986. Ships of all sizes gathered in New York Harbor, concerts marked the milestone, and unprecedented fireworks lit the sky in honor of our cherished statue.
|
||||
|
||||
With ongoing support from our generous donors, the Foundation has continued pursuing its mission of preserving this beacon of hope. Contributions have supported hundreds of projects large and small, implemented in collaboration with our National Park Service partner.
|
||||
|
||||
Our latest major initiative is the Statue of Liberty Museum, which opened in May 2019. Millions of dollars in donations, commemorated with a series of collectible medallions, made this award-winning museum a reality.
|
||||
|
||||
Harkening back to Pulitzer’s campaign, the Foundation conducted its first crowdfunding campaign and added the name of every donor giving at $18.86 to the museum’s “Founders Registry.”
|
||||
|
||||
In 1865, Édouard de Laboulaye, a French political intellectual and activist, proposed gifting the United States with a statue honoring the centennial of America’s independence, the abolition of slavery, and the long-standing friendship between the two nations.
|
||||
|
||||
French sculptor Auguste Bartholdi supported Laboulaye’s idea, and in 1870 began designing the globally recognized statue, “Liberty Enlightening the World.”
|
||||
<last-updated>February 05, 2010</last-updated></body>
|
||||
</html>
|
||||
@@ -1,34 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Google Play</title>
|
||||
<meta name="description" content="CNN Breaking US & World News - Apps on Google Play">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/file14.html</h1>
|
||||
<p><a href="file15.html">file15.html</a></p>
|
||||
<p><a href="file16.html">file16.html</a></p>
|
||||
Data safety
|
||||
|
||||
Safety starts with understanding how developers collect and share your data. Data privacy and security practices may vary based on your use, region, and age. The developer provided this information and may update it over time.
|
||||
CNN has correspondents and bureaus reporting from across the world to keep you updated and informed on what matters most.
|
||||
|
||||
Never miss a breaking news story or important update again. Personalize your experience and choose the news you want by category: Politics, Business, Health, Entertainment and more. Get the news you want, the way you want.
|
||||
|
||||
Stay informed with CNN:
|
||||
• Get daily news, in-depth reporting, expert commentary and more.
|
||||
• Read articles and save them for later.
|
||||
• Set custom alerts and notifications for news updates.
|
||||
• Listen to CNN Live audio.
|
||||
• Watch exclusive news programs from the anchors you trust.
|
||||
• U.S. and Canada: Watch CNN, CNN International and HLN live coverage with your TV provider.
|
||||
• U.S. and Canada: Watch CNN Original Series and Films on-demand with your TV provider.
|
||||
|
||||
Enjoy a 10-minute preview of live CNN coverage each day. Log in with your TV provider for unlimited viewing.
|
||||
|
||||
Data rates may apply.
|
||||
|
||||
CNN Privacy Policy: https://www.cnn.com/privacy
|
||||
For California residents only, manage sharing your data at: https://www.warnermediaprivacy.com/do-not-sell
|
||||
<last-updated>May 25, 2016</last-updated></body>
|
||||
</html>
|
||||
@@ -1,13 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Apple Music</title>
|
||||
<meta name="description" content="Lady Gaga">
|
||||
</head>
|
||||
<body>
|
||||
Some may dismiss pop as inauthentic. But for Lady Gaga—one of popular culture's greatest, most extravagant creations—the inauthenticity is the point.
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/file15.html</h1>
|
||||
<p><a href="file16.html">file16.html</a></p>
|
||||
<p><a href="subdir6/subdir7/file17.html">file17.html</a></p>
|
||||
<last-updated>April 03, 2014</last-updated></body>
|
||||
</html>
|
||||
@@ -1,18 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Wikipedia</title>
|
||||
<meta name="description" content="Tom Cruise - Wikipedia">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/file16.html</h1>
|
||||
<p><a href="subdir6/subdir7/file17.html">file17.html</a></p>
|
||||
<p><a href="file14.html">file14.html</a></p>
|
||||
|
||||
Actor. Producer. Running in movies since 1981. Worldwide.
|
||||
Thomas Cruise Mapother IV (born July 3, 1962), known professionally as Tom Cruise, is an American actor. One of the world's highest-paid actors, ...
|
||||
Tom Cruise
|
||||
Tom Brady
|
||||
Cruise ships are large passenger ships used mainly for vacationing. Unlike ocean liners, which are used for transport, cruise ships typically embark on round-trip voyages to various ports of call, where passengers may go on tours known as "shore excursions".
|
||||
<last-updated>September 14, 2019</last-updated></body>
|
||||
</html>
|
||||
@@ -1,21 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Bleacher Report</title>
|
||||
<meta name="description" content="Boston Celtics - NBA">
|
||||
</head>
|
||||
<body>
|
||||
In the past 3 regular seasons, only 8 players have signed an extension during the regular season.
|
||||
What Boston cares about right now: Get breaking updates on news, sports, and weather. Local alerts, things to do, and more on Boston.com.
|
||||
Blazers starting Ant, Scoot, Mattise, Jerami and Moses. Ayton, Brogdon and Williams are all out.
|
||||
What Sam Cassell and Charles Lee Bring to the Boston Celtics Myles Turner Bojan Bogdanovic Bogdan Bogdanovic Nick Richards Al Horford
|
||||
Dorian Finney-Smtih
|
||||
Patrick Beverley
|
||||
Jrue Holiday
|
||||
|
||||
Only Holiday topped $30M+ in year 1
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/file17.html</h1>
|
||||
<p><a href="file18.html">file18.html</a></p>
|
||||
<p><a href="file19.html">file19.html</a></p>
|
||||
<last-updated>December 18, 2012</last-updated></body>
|
||||
</html>
|
||||
@@ -1,20 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>National Geographic Kids</title>
|
||||
<meta name="description" content="Statue of Liberty">
|
||||
</head>
|
||||
<body>
|
||||
It all started at dinner one night near Paris in 1865. A group of Frenchmen were discussing their dictator-like emperor and the democratic government of the U.S. They decided to build a monument to American freedom—and perhaps even strengthen French demands for democracy in their own country. At that dinner was the sculptor Frédéric-Auguste Bartholdi (bar-TOLE-dee). He imagined a statue of a woman holding a torch burning with the light of freedom.
|
||||
|
||||
Turning Bartholdi’s idea into reality took 21 years. French supporters raised money to build the statue, and Americans paid for the pedestal it would stand on. Finally, in 1886, the statue was dedicated.
|
||||
|
||||
On July 4, 1884 France presented the United States with an incredible birthday gift: the
|
||||
Statue of Liberty! Without its pedestal it’s as tall as a 15-story
|
||||
building. She represents the United States. But the world-famous Statue of Liberty standing in New York Harbor was built in France. The statue was presented to the U.S., taken apart, shipped across the Atlantic Ocean in crates, and rebuilt in the U.S. It was France’s gift to the American people.
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/file18.html</h1>
|
||||
<p><a href="file17.html">file17.html</a></p>
|
||||
<p><a href="subdir8/file22.html">file22.html</a></p>
|
||||
|
||||
<last-updated>June 26, 2016</last-updated></body>
|
||||
</html>
|
||||
@@ -1,22 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>GRAMMY.com</title>
|
||||
<meta name="description" content="Lady Gaga | Artist">
|
||||
</head>
|
||||
<body>
|
||||
Nominations.
|
||||
All GRAMMY Awards and Nominations for Lady Gaga. View All Nominations For This Artist · Born Stefani Joanne Angelina Germanotta on March 28, 1986, in New York City, New York · Lady Gaga's debut album, 2008's The ...
|
||||
You have to be unique, and different, and shine in your own way."
|
||||
|
||||
Born Stefani Joanne Angelina Germanotta on March 28, 1986, in New York City, New York
|
||||
Lady Gaga's debut album, 2008's The Fame, reached No. 2 on the Billboard 200. The album featured a pair of No. 1 singles: "Just Dance" and "Poker Face."
|
||||
Gaga won her first two GRAMMYs at the 52nd GRAMMY Awards for Best Dance Recording ("Poker Face") and Best Dance/Electronic Album (The Fame). She also made her first performance appearance on the GRAMMY stage that same year, singing a medley that included "Poker Face" and "Speechless" and "Your Song" in a duet with Elton John.
|
||||
Did you know? As a teenager, Lady Gaga honed her singing by working with vocal coach Don Lawrence.
|
||||
Gaga covered "You've Got A Friend" at the 2014 MusiCares Person of the Year tribute gala honoring Carole King. In 2016 the GRAMMY Museum honored her with the Jane Ortner Artist Award for her support for the arts and music education.
|
||||
In 2011 Lady Gaga founded the Born This Way Foundation, an organization committed to supporting the wellness of young people.
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/file19.html</h1>
|
||||
<p><a href="file17.html">file17.html</a></p>
|
||||
<p><a href="subdir8/file22.html">file22.html</a></p>
|
||||
<last-updated>November 02, 2004</last-updated></body>
|
||||
</html>
|
||||
@@ -1,18 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Expedia</title>
|
||||
<meta name="description" content="Top Hotels in Troy, NY from $55">
|
||||
</head>
|
||||
<body>
|
||||
Recent reviews of Troy hotels. Expedia verified reviews ... Gideon Putnam Resort And Spa ... La Quinta Inn & Suites by Wyndham Clifton Park ... The Hotel Saratoga, ...
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/subdir8/file20.html</h1>
|
||||
<p><a href="file21.html">file21.html</a></p>
|
||||
<p><a href="file22.html">file22.html</a></p>
|
||||
|
||||
The Raptors 905 and Long Island Nets both made trades on Thursday, acquiring potential rotation players for next season. The Boston Celtics are a National Basketball Association team that plays in the Eastern Conference. The Celtics have won 17 NBA titles over the course of ..
|
||||
Join others and track this artist. Maroon 5 is an American pop rock band that originated in Los Angeles, California, United States. See Instagram photos and videos from Maroon 5 (@maroon5).
|
||||
Renamed Maroon 5, the musicians in 2002 released an album, Songs About Jane, that showcased its new funky sound. The LP was not an immediate hit ...
|
||||
Heartfelt performances by the likes of Lady Gaga and Rihanna came alongside disappointments for the year's biggest music-themed movies. by: Matthew Strauss.
|
||||
<last-updated>January 02, 2015</last-updated></body>
|
||||
</html>
|
||||
@@ -1,14 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Youtube</title>
|
||||
<meta name="description" content="Maroon 5">
|
||||
</head>
|
||||
<body>
|
||||
Celebrate the end of 2023 with us in the Bahamas! Tickets on sale now.
|
||||
Welcome to the Maroon 5 official Youtube Channel. Head to our website for more info- www.maroon5.com · maroon5.lnk.to/MiddleGroundand 5 more links.
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/subdir8/file21.html</h1>
|
||||
<p><a href="file22.html">file22.html</a></p>
|
||||
<p><a href="file23.html">file23.html</a></p>
|
||||
<last-updated>December 26, 2023</last-updated></body>
|
||||
</html>
|
||||
@@ -1,17 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Maroon 5</title>
|
||||
<meta name="description" content="Maroon 5: Home">
|
||||
</head>
|
||||
<body>
|
||||
Tour Dates ; Oct 14, 2023. Audacy's 10th Annual We Can Survive. Newark, NJ ; Nov 10, 2023. Amman Exhibition Park. Amman, Jordan ; Nov 13, 2023. Al Dana Ampitheatre.
|
||||
SIGN UP FOR MAROON 5 NEWS. Emails will be sent by or on behalf of Universal Music Group 2220 Colorado Avenue, Santa Monica , CA 90404 (310) 865-4000. You may withdraw your consent at any time. Privacy Policy / Do Not Sell My Personal Information
|
||||
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/subdir8/file22.html</h1>
|
||||
<p><a href="file20.html">file20.html</a></p>
|
||||
<p><a href="file23.html">file23.html</a></p>
|
||||
|
||||
We are passionate about music. To help us share that passion we'd like to use cookies and similar technologies to personalize your experiences on our sites and to advertise on other sites. For more information and additional choices click Cookie Choices below.
|
||||
<last-updated>October 02, 2000</last-updated></body>
|
||||
</html>
|
||||
@@ -1,27 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Screen Rant</title>
|
||||
<meta name="description" content="Tom Cruise's Age In Mission Impossible 7 Is A Shocking ...">
|
||||
</head>
|
||||
<body>
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/subdir8/file23.html</h1>
|
||||
Not only does this make many of the movie's stunts and his physical performance all that more special, but comparing it to the original movie makes it truly shocking.
|
||||
Mission: Impossible - Dead Reckoning Part One is another showcase for Tom Cruise's incredible action movie star qualities, but the movie also acts as a shocking reminder of the franchise's longevity thanks to his age. The seventh entry in the Mission: Impossible franchise arrived 27 years after the original movie that rebooted the TV series as an action/thriller movie series. Audiences have spent nearly three decades watching Tom Cruise becoming increasingly more daring with his stunts in an effort to bring more people to theaters and treat them to something they've never seen. It's only become more impressive as he gets older too.
|
||||
<p><a href="file22.html">file22.html</a></p>
|
||||
<p><a href="file21.html">file21.html</a></p>
|
||||
<p><a href="subdir9/file24.html">file24.html</a></p>
|
||||
|
||||
SUMMARY
|
||||
Tom Cruise's age in the Mission: Impossible franchise serves as a shocking reminder of the franchise's longevity, as he continues to perform daring stunts at 59 and 60 years old.
|
||||
Comparing Tom Cruise's age in Mission: Impossible 7 to Jon Voight's age in the original movie highlights Cruise's commitment and the franchise's success.
|
||||
Despite being in his 60s, Tom Cruise shows no signs of slowing down as Ethan Hunt, and he plans to continue playing the character until he is in his 80s, making Mission: Impossible movies even more exciting.
|
||||
|
||||
Boston Celtics rookie Jordan Walsh is already working toward earning the trust of his teammates weeks ahead of the 2023-24 season.
|
||||
|
||||
When Walsh was selected 38th overall in this summer’s NBA draft, the Celtics envisioned Walsh’s greatest asset translating from Arkansas to Boston: Defense, something the C’s desperately need.
|
||||
|
||||
Since the Celtics have a questionable reserve unit tasked with supporting a star-studded core, Walsh has the perfect opportunity to grow as a household name in Boston, and so far, so good for the 19-year-old.
|
||||
|
||||
<last-updated>September 14, 2013</last-updated></body>
|
||||
</html>
|
||||
@@ -1,26 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Boston.com</title>
|
||||
<meta name="description" content="Boston.com: Local breaking news, sports, weather, and things ...">
|
||||
</head>
|
||||
<body>
|
||||
|
||||
Welcome to the official homepage.
|
||||
News: Stay up to date with Boston.com coverage of News. Sports: Patriots - Red Sox - Celtics - Bruins - New England Revolution - ...
|
||||
Local News: Stay up to date with Boston.com coverage of Local News.
|
||||
Buy Maroon 5 tickets from the official Ticketmaster.com site. Find Maroon 5 tour schedule, concert details, reviews and photos.
|
||||
Maroon 5 discography and songs: Music profile for Maroon 5, formed 2001. Genres: Pop Rock, Pop, Dance-Pop. Albums include Songs About Jane, It Won't Be Soon ...
|
||||
Maroon 5 is an American pop rock band. They were originally formed in 1994 as… Top Scholar. AlexanderJamesM. Popular Maroon 5 songs. Girls Like You (Remix).
|
||||
Pop Rock band from Los Angeles, US. Formed in 1994 as Kara's Flowers, renamed to Maroon 5 in 2001. Members Adam Levine - Vocals/Guitar (1994-)
|
||||
Maroon 5 — and, specifically, its frontman Adam Levine — became the face of blue-eyed soul in the 21st century, managing to navigate shifting trends in mus.
|
||||
Adam Levine Talks Beautiful Mistakes ft. Megan Thee Stallion & Maroon 5's Upcoming Album "Jordi". Zach Sang Show. 661 - Maroon 5 (Adam Levine & James ...
|
||||
Things To Do: Your guide to fun in Boston. Planning your weekend? Sign ...
|
||||
What Boston cares about right now: Get breaking updates on news, sports, and weather. Local alerts, things to do, and more on Boston.com.
|
||||
Here's What Jayson Tatum Told Celtics After Ime Udoka's Removal
|
||||
Right before training camp last year, the Celtics had to figure out how to move forward without Ime Udoka, who had led them on a Finals run.
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/subdir8/subdir9/file24.html</h1>
|
||||
<p><a href="../file23.html">file23.html</a></p>
|
||||
<p><a href="file25.html">file25.html</a></p>
|
||||
<last-updated>May 16, 2020</last-updated></body>
|
||||
</html>
|
||||
@@ -1,15 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Boston University</title>
|
||||
<meta name="description" content="Boston University: Homepage">
|
||||
</head>
|
||||
<body>
|
||||
The official National Hockey League website including news, rosters, stats, schedules, teams, and video.
|
||||
Maroon 5 is known for Love Actually (2003), He's Just Not That Into You (2009) and The Wedding Date (2005). More at IMDbPro.
|
||||
Let us help you plan your vacation or meeting and discover everything Boston MA has to offer. Boston University is a leading private research institution with two primary campuses in the heart of Boston and programs around the world.
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/subdir8/subdir9/file25.html</h1>
|
||||
<p><a href="subdir10/file26.html">file26.html</a></p>
|
||||
<p><a href="subdir10/file27.html">file27.html</a></p>
|
||||
<last-updated>July 07, 2008</last-updated></body>
|
||||
</html>
|
||||
@@ -1,23 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>The Vogue</title>
|
||||
<meta name="description" content="Maroon 5">
|
||||
</head>
|
||||
<body>
|
||||
Stay informed on our latest news. Maroon is a brownish crimson color that takes its name from the French word marron, or chestnut.
|
||||
The Port of Los Angeles is America's Port®, the nation's #1 container port and global model for sustainability, security, and social responsibility.
|
||||
Los Angeles magazine is the definitive resource on the people, food, culture, arts & entertainment, fashion, lifestyle, & Los Angeles news.
|
||||
Maroon 5 is an American pop rock band that originated in Los Angeles, California, United States. The group was formed in 1994 as Kara’s Flowers while its members were still in high school and originally consisted of Adam Levine (lead vocals, guitar), Jesse Carmichael (guitar, backing vocals) Mickey Madden (bass guitar) and Ryan Dusick (drums). Kara’s Flowers signed to Reprise Records and released an album, “The Fourth World”, in 1997. After a tepid response to the album, the band parted ways with the record label and the members attended college. Dusick and Madden stayed local at UCLA, while Levine and Carmichael relocated to the east coast to attend Five Towns College. While Levine and Carmichael were in New York, they began to take notice of the urban music surrounding them and later let the style influence the songs they wrote.
|
||||
|
||||
In 2001, the band regrouped, with guitarist James Valentine added to the line-up, and pursued a new direction under the name: Maroon 5. At this point, Jesse Carmichael switched to playing the keyboards and this has become his main instrument in the band (although he also plays rhythm guitar to this day). After these changes, Maroon 5 signed with Octone Records and released their debut album, “Songs About Jane”, in June 2002. The album’s lead single “Harder to Breathe”, received heavy airplay, which helped the album to debut at number six on the Billboard 200 chart. The band won the Grammy Award for Best New Artist in 2005. For the next few years, the band toured extensively worldwide in support of Songs About Jane and produced two live recordings: 2004’s “1.22.03.Acoustic” and 2005’s “Live Friday the 13th”. In 2006, Ryan Dusick officially left the band after suffering from serious wrist and shoulder injuries and was replaced by Matt Flynn. The band recorded their second album, “It Won’t Be Soon Before Long” and released it in May 2007. The album reached number one on the US Billboard 200 chart and the lead single, “Makes Me Wonder”, became the band’s first number-one song on the Billboard Hot 100.
|
||||
|
||||
On September 21, 2010, the band released their third studio album entitled “Hands All Over”. It was re-released in 2011 to include the single “Moves like Jagger” (featuring Christina Aguilera). While the original version of the album received mixed reviews, “Moves like Jagger” reached the number one position on the Billboard Hot 100 chart. The band released their next album, “Overexposed”, on June 26, 2012. All four singles of the album were really successful on the charts – for example, the second single from Overexposed, “One More Night” (released on June 19, 2012), reached number one on the Billboard Hot 100. In 2014, the band signed with Interscope Records and released their fifth studio album, “V”, with the very-first line-up of six official band members, as keyboardist and backing vocalist PJ Morton became an official member in 2012. The album debuted at top of the weekly Billboard 200 chart. Since its debut in 2002, the band has sold more than 15 million albums in the United States and over 100 million singles and 27 million albums worldwide. Read more on Last.fm. User-contributed text is available under the Creative Commons By-SA License; additional terms may apply.
|
||||
|
||||
Similar Artists *NSYNC3 Doors Down30 Seconds To Mars5 Seconds Of SummerAdam Lambert
|
||||
Join our
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/subdir8/subdir9/subdir10/file26.html</h1>
|
||||
<p><a href="file27.html">file27.html</a></p>
|
||||
<p><a href="../file24.html">file24.html</a></p>
|
||||
<last-updated>October 13, 2008</last-updated></body>
|
||||
</html>
|
||||
@@ -1,30 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>The New York Times</title>
|
||||
<meta name="description" content="My Impossible Mission to Find Tom Cruise">
|
||||
</head>
|
||||
<body>
|
||||
Celtics And Mavs G League Teams Make A Trade.
|
||||
The Boston Celtics and Dallas Mavericks G League affiliates have made a trade. Pinson's G League rights were traded (on Friday) from the Maine Celtics (G League affiliate of the Boston Celtics) to the Texas Legends (G League affiliate of the Mavs).
|
||||
Official online store of Maroon 5. Fans can purchase exclusive merchandise including t-shirts, sweatshirts, hats and more.
|
||||
The best Maroon 5 songs trace the band's evolution from pop-rock ones-to-watch to a genre-defying act that continues to surprise us.
|
||||
Bustling Los Angeles is home to Hollywood stars, high-fashion shopping, enchanting beach spots and culture. Visit Los Angeles to explore all this and more.
|
||||
Los Angeles' best sights and local secrets from travel experts you can trust.
|
||||
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/subdir8/subdir9/subdir10/file27.html</h1>
|
||||
Actress Katie Holmes found fame as a teenager before weathering a steady diet of tabloid storms as an adult.
|
||||
A comprehensive guide and a collection of tips for visiting Los Angeles from the experts at Condé Nast Traveler.
|
||||
Explore Los Angeles by public transit. Start with Metro schedules, maps, arrival times, parking info, trip planning, stations, and fares.
|
||||
Saving wildlife, enriching communities, and creating connections to nature. Advance online reservations are recommended.
|
||||
<p><a href="file26.html">file26.html</a></p>
|
||||
<p><a href="../file25.html">file25.html</a></p>
|
||||
<p><a href="file28.html">file28.html</a></p>
|
||||
Box office data for this person is now available exclusively on IMDbPro.
|
||||
Located in the heart of LA, the Los Angeles Convention Center (LACC) is the preeminent destination for meetings, conventions, and special events.
|
||||
In an interview with Playboy in 2012, Tom Cruise described Katie Holmes as “an extraordinary person” with a “wonderful” clothing line,
|
||||
Tom Cruise has come along way from teen heartthrob.
|
||||
Katie Holmes steps out in $150 flats from a Meghan Markle-loved brand. Katie Holmes is an American actress. She is best known for her role as Joey Potter on the TV series Dawson's Creek, which ran from 1998 to 2003.
|
||||
|
||||
<last-updated>April 07, 2000</last-updated></body>
|
||||
</html>
|
||||
@@ -1,24 +0,0 @@
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Facebook</title>
|
||||
<meta name="description" content="Boston University">
|
||||
</head>
|
||||
<body>
|
||||
Get up-to-date admissions statistics, SAT scores, student reviews, and more from The Princeton Review.
|
||||
Boston University, Boston, Massachusetts. 387601 likes · 5379 talking about this. Welcome to Boston University. Boston University, Boston, Massachusetts. 387601 likes · 5379 talking about this. Welcome to Boston University.
|
||||
A leading research institution in the heart of #Boston. Follow us to hear about all things BU! : social@bu.edu | Community Guidelines: http://spr.ly/ ..
|
||||
After more than 135 years, continuing to reinvent ourselves to serve you and help businesses thrive in the Los Angeles region.
|
||||
Boston University is one of the top private universities in Boston, United States. It is ranked #=93 in QS World University Rankings 2024. # ...
|
||||
Things to Do in Los Angeles, California: See Tripadvisor's 771082 traveler reviews and photos of Los Angeles tourist attractions. Find what to do today, ...
|
||||
|
||||
Guide to the best hotels and things to do in Los Angeles. Maps, travel tips and more.
|
||||
Check out full game highlights from the Celtics' 123-110 preseason win over the New York Knicks.
|
||||
Kristaps Porzingis, Jrue Holiday and the Celtics looked comfortable in a preseason contest against the Knicks on Tuesday.
|
||||
The Celtics' starters got their regular minutes in Tuesday night's preseason tilt with the Knicks, and they looked ready for Opening Night.
|
||||
Jayson Tatum admits Celtics practices have been ‘hard as (expletive)’.
|
||||
<h1>html_files/subdir1/subdir2/subdir3/subdir5/subdir6/subdir7/subdir8/subdir9/subdir10/file28.html</h1>
|
||||
<p><a href="../file24.html">file24.html</a></p>
|
||||
<p><a href="../file25.html">file25.html</a></p>
|
||||
<last-updated>August 27, 2001</last-updated></body>
|
||||
</html>
|
||||
Binary file not shown.
|
Before Width: | Height: | Size: 48 KiB |
Binary file not shown.
|
Before Width: | Height: | Size: 26 KiB |
Binary file not shown.
|
Before Width: | Height: | Size: 90 KiB |
Binary file not shown.
|
Before Width: | Height: | Size: 4.5 KiB |
Binary file not shown.
|
Before Width: | Height: | Size: 2.4 KiB |
@@ -1,60 +0,0 @@
|
||||
Tom
|
||||
Tom Cruise
|
||||
Tom and Jerry
|
||||
"Tom Brady"
|
||||
Tom Hanks
|
||||
Tomato
|
||||
tomato
|
||||
Statue of Liberty
|
||||
"Statue of Liberty"
|
||||
Pearl Harbor
|
||||
Susan Sarandon
|
||||
Manchester United
|
||||
CNN
|
||||
Boston
|
||||
Boston Celtics
|
||||
"Boston Celtics"
|
||||
Boston University
|
||||
Lady Gaga
|
||||
Maroon 5
|
||||
Los Angeles
|
||||
Facebook
|
||||
LA Lakers
|
||||
Lakers
|
||||
Nicole Kidman
|
||||
Keith Urban
|
||||
Golden State Warriors
|
||||
Splash Brothers
|
||||
Denver Nuggets
|
||||
Nikola Jokic
|
||||
Kobe Bryant Vanessa Bryant
|
||||
Devin Booker Kendall Jenner
|
||||
Keeping up With the Kardashians
|
||||
The Tonight Show Starring Jimmy Fallon
|
||||
Emma Stone
|
||||
Poor Things
|
||||
Everything Everywhere All at Once
|
||||
"Everything Everywhere All at Once"
|
||||
Netflix
|
||||
Academy Awards
|
||||
Grammy Awards
|
||||
James Harden
|
||||
Kawhi Leonard
|
||||
Jude Bellingham
|
||||
Kim Kardashian
|
||||
Chelsea FC
|
||||
Rihanna Stay
|
||||
Rihanna Diamonds
|
||||
Rihanna Umbrella
|
||||
Adele
|
||||
Someone Like You
|
||||
Hometown Glory
|
||||
Prison Break
|
||||
Breaking Bad
|
||||
"Breaking Bad"
|
||||
Anna Gunn
|
||||
Skyler White
|
||||
Bryan Cranston
|
||||
Walter White
|
||||
Aaron Paul
|
||||
Jesse Pinkman
|
||||
Reference in New Issue
Block a user