Making the Web Look Like a Database
Web Query Languages
- Treat the Web as a graph (pages = nodes, links = edges).
- Query the graph (e.g., Find all pages within one hop of pages with the words “Cars for Sale”).
Wrappers
- Find page of interest.
- Parse page to extract attribute-value pairs and insert them into a database.
- Write parser by hand.
- Use syntactic clues to generate parser semi-automatically.
- Query the database.