Archive for the ‘Improve the Quality of Results’ Category

Improve Information Source Selection

August 13, 2011 Comments off

Web content is made for humans to read/look-at (“eye candy”, advertisements, links to other content, etc.). This poses a challenge for automated readers to identify topical information and context boundaries.

For selected sites (like Wikipedia and the US Patent Office), we have readers with individual “Reading Plans” that focus on the information of interest. For all other links, SIRA reads the complete web page and all links for relevant content. This costs time and can lead to finding less relevant content. One way to focus a read of all the links on a page is to create a separate web page with just the links of interest. Then you can specify that page as the base page for SIRA reading.


Write better Research Statements

August 13, 2011 Comments off

Short clear sentences work so much better. (e.g. “The student of the professor on the balcony is happy.” Is the student or professor “on the balcony”?)

Be sure you want all the combinatorics implied by compound sentences. Compound sentences can generate a large number of simple semantic assertions. For example, “Joe, Jane, and bob bought a car, played the saxophone, did cartwheels, and went home.” Becomes 12 simple sentences:
1.    Jane bought a car.
2.    Jane did cartwheels.
3.    Jane played the saxophone.
4.    Jane went home.
5.    Joe bought a car.
6.    Joe did cartwheels.
7.    Joe played the saxophone.
8.    Joe went home.
9.    Robert bought a car.
10.    Robert did cartwheels.
11.    Robert played the saxophone.
12.    Robert went home.

Now try, “Joe and Jane bought and sold a car and truck.”, which become 8 simple sentences:
1.    Jane bought a car.
2.    Jane bought a truck.
3.    Jane sold a car.
4.    Jane sold a truck.
5.    Joe bought a car.
6.    Joe bought a truck.
7.    Joe sold a car.
8.    Joe sold a truck.

Fewer words are better, for example, “Joe will fly to the store and shop.” which semantically becomes:
1.    Joe will fly to the store. (fly to a place)
2.    Joe will fly to the shop. (fly to a place)
3.    Joe will fly to shop. (fly and then do some shopping)
4.    Joe will shop. (not necessarily connected with flying to the store)

We will post more on this topic later.

When Domain Knowledge is needed to translate the research statement

August 13, 2011 Comments off

In many cases automated domain-aware reading and writing is not enough for a satisfactory Research Assistant. A good Research Assistant often needs to apply additional domain knowledge to translate (expand) a “research request” into semantically equivalent and findable reading objectives (e.g. What do you mean by “similar” and what constitutes evidence that something is “similar”?).

Suppose you have a Statement in your Investigation like, “Candidate must have good communication skills.” You may want to translate that to, “#?# has good communication skills.” This allows for multiple unknown ways of saying “candidate”. What about the multiple ways to indicate “good communication skills”?

Evidence of having “good communication skills” could include: “gave presentation”, “brokered agreement”, “published paper”, or anything that implies the presence of something within the domain (topic) of the research. You may want to expand your investigation to include the statements that imply each investigation statement.

Note: We have built additional tools [not yet released] to support the automation of domain-aware expansion of investigation statements based on “Implication/Evidence Patterns”. Please contact us for more information.