Home > Improve the Quality of Results > Improve Information Source Selection

Improve Information Source Selection

August 13, 2011

Web content is made for humans to read/look-at (“eye candy”, advertisements, links to other content, etc.). This poses a challenge for automated readers to identify topical information and context boundaries.

For selected sites (like Wikipedia and the US Patent Office), we have readers with individual “Reading Plans” that focus on the information of interest. For all other links, SIRA reads the complete web page and all links for relevant content. This costs time and can lead to finding less relevant content. One way to focus a read of all the links on a page is to create a separate web page with just the links of interest. Then you can specify that page as the base page for SIRA reading.

%d bloggers like this: