Example Proper Name Searches
The following table describes how the ProperNames setting affects the terms that the Content component stores for the sentence Tom Jones And His greatest hits.
| Original | Tom | Jones | And His | greatest | hits | |||
0
|
TOM
|
JONE
|
GREAT
|
HIT
|
||||
1
|
TOM
|
TOMJON
|
JONE
|
GREAT
|
HIT
|
|||
2
|
TOM
|
TOMJON
|
JONE
|
GREAT
|
GREATESTHIT
|
HIT
|
||
3
|
TOM
|
TOMJON
|
JONE
|
ANDHI
|
GREAT
|
HIT
|
||
4
|
TOM
|
TOMJON
|
JONE
|
JONESAND
|
ANDHI
|
GREAT
|
HIT
|
|
5
|
TOM
|
TOMJONES
|
JONE
|
ANDHIS
|
GREAT
|
HIT
|
||
6
|
TOM
|
TOMJONES
|
JONE
|
JONESAND
|
ANDHIS
|
GREAT
|
HIT
|
|
7
|
TOM
|
JONE
|
JONESAND
|
ANDHIS
|
GREAT
|
HIT
|
If the Content component contains these documents, the following queries produce different results according to your ProperNames settings.
| Doc 1: | Doc 2: | |
| Tom Waits and The The in concert with Norah Jones | Tom Jones and the the in concert with Katie Melua |
action=Query&Text=Tom Jones
If you set
ProperNamesto0or7, both documents return with the same relevance (in both cases, the query to Content has the terms TOM and JONE, which match both documents).If you set
ProperNamesto1,2,3,4,5, or6, Doc 2 returns with a higher relevance than Doc 1 (because it matches not just the terms TOM and JONE, but also TOMJON or TOMJONES).action=Query&Text=tom jones
If you set
ProperNamesto0,1,3,4,5,6, or7, both documents return with the same relevance (in both cases, the query to Content has the terms TOM and JONE, which match both documents).If you set
ProperNamesto2, Doc 2 returns with a higher relevance than Doc 1 (because it matches not just the terms TOM and JONE, but also TOMJON).action=Query&Text=The The
If you set
ProperNamesto0,1, or2, the query returns no results (because Content discards both instances of the word The as stop words).If you set
ProperNamesto3,4,5,6, or7, only Doc 1 returns (because in all these cases the query to Content has the term THETH or THETHE, which matches only Doc 1).action=Query&Text=the the
If you set
ProperNamesto0,1,2,3,4,5,6, or7, no results return (because Content discards both instances of the word the as stop words).