CLIA
Research Areas
- Indian Language Search Engines
- Indian language stemmers
- Transliterated Roman to UTF-8 conversion for Indian language content
- Language Identification on web pages
- Clustering and Categorization CLIA
- Cross language document classification
- Multi language news clustering
- Language specific and domain specific focused crawling
- Page life calculation and Scheduling re-crawls
Projects
-
- WebKhoj - Indian Language Search Engine Technology
- An Indian language web search engine called WebKhoj was developed at SIEL. While general search engines like Google, Yahoo and others can search UTF-8 content, they are unable to search many Indian language sites which are encoded in proprietary encoding. This search engine overcomes this hurdle and also can overcome the agglutinative issues of morphologically rich languages of India.
This search technology is licensed to a few commercial organizations to power real world search engines.
Team
- Ram Bhupal Reddy (Research Engineer)
- Padmini
- Sethu
- Kosuru Pavan
- Srinivas
- Nishant
- Sundeep
- V. V. Chaitanya
- PDSR Sandeep
- N. Krishna Chaitanya
- Charan
- Rohit Bharadwaj
- Kranthi
- Sahiti
