Automatic Construction and Sophisticated Querying of Domain-Specific Web Search Engines

Goal The goal of this project is developing an automated system that allows constructing specialized Web portals with sophisticated querying features. In particular, the construction of such a specialized portal requires:

  • crawling all the Web pages that are related to a specific target domain or topic,
  • extracting structured information from these pages (with least possible human intervention) and finally,
  • providing effective and advanced querying features over the whole Web repository.

Sponsor Scientific and Technical Research Council of Turkey - TÜBITAK (Grant No: 105E024)

Duration 2005-2007

Budget 124,560 YTL (~$95,000)

People

Principle Investigator Prof. Özgür Ulusoy
Graduate Students Ismail Sengor Altingovde
Rifat Ozcan
Senior Project Students
 
Esra Küçükoğuz (B.S., 2007)
İnci Durmaz (B.S., 2007)
Begüm Saygeçitli (B.S., 2007)
Tuğba Yıldız (B.S., 2007)
Melek Yüksel (B.S., 2007)
Berk Atikoğlu (B.S., 2007)
Hacı Mehmet Yıldırım (B.S., 2007)
Fatih Boyacı (B.S., 2007)
Pelin Angın (B.S., 2007)
Leman Ak (B.S., 2007)
Süleyman Çetintaş (B.S., 2006)
Hakan Yılmaz (B.S., 2006)
Ozan Özcan Dolu (B.S., 2006)
Akif Boynueğri (B.S., 2006)

Related senior projects

Publications

  1. I. S. Altingovde, E. Demir, F. Can, Ö. Ulusoy, Incremental Cluster-Based Retrieval Using Compressed Cluster-Skipping Inverted Files, ACM Transactions on Information Systems (TOIS), vol.26, no.3, 2008. (pdf copy)

     

  2. I. S. Altingovde, F. Can, Ö. Ulusoy, Efficient Processing of Category-Restricted Queries for Web Directories, 30th European Conference on Information Retrieval (ECIR'08),  Lecture Notes in Computer Science (Springer Verlag), vol.4956, 2008.(pdf copy)

     

  3. R. Özcan, I. S. Altingovde, Ö. Ulusoy, Static Query Result Caching Revisited, 17th International Conference on World Wide Web (WWW'08), Beijing, China, 2008. (pdf copy)

     

  4. I. S. Altingövde, R. Ozcan, S. Cetintas, H.Yilmaz, Ö. Ulusoy, An Automatic Approach to Construct Domain-Specific Web Portals, ACM 16th Conference on Information and Knowledge Management (CIKM’07), Lisbon, Portugal, November 2007.(pdf copy

     

  5. I. S. Altingovde, R. Ozcan, H.C. Ocalan, F. Can, Ö. Ulusoy, Large-Scale Cluster-Based Retrieval Experiments on Turkish Texts, 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'07), Amsterdam, The Netherlands, July 2007. (pdf copy)

     

  6. I. S. Altingovde, F. Can, Ö. Ulusoy, Algorithms for Within-Cluster Searches Using Inverted Files, International Symposium on Computer and Information Sciences (ISCIS'06), Lecture Notes in Computer Science (Springer Verlag), vol.4263, 2006. (pdf copy)