03-60-569 Semantic Web Project

Home - Design - Demo - Crawler -PageRank Near Duplicate - Search Engine - Images and Stats - FrameworkSimilarity- Contacts

  This is the project page for Jordan Willis and James Reid's Semantic Web Project.  Our project goal was to create a scalable distributed web crawler with a search engine to query the pages crawled, which uses a Page Ranking to prioritize and sort search result while removing near duplicates before indexing and addressing similar search returns.    The Apache Lucene search engine which can be viewed at the Project Search Engine site .

Link to Project Status Report
Link to Final Summary Report
Link to Code and Documentation Package