Hi,
I want to implement as search program which will be able to search through a given set of documents for a given string...I've heard that indexed searches are the best bet in these cases...(if anyone knows of something better i would appreciate it..)..now if i have a set of index keywords for a given document and all the keywords for the whole set of documents, my search then would be quite simple.The problem however is creating this index..that is, how do i scan a document and list out its keywords.
for eg in the sentence below:
The Velociraptor had a height of 6 feet.
here words like 'velociraptor' , height and '6 feet' would be keywords..
Well i have thought of basically removing the filler words like 'the' 'a' etc...
Is there a better method? Otherwise does anyone have any ideas on how I can refine the above method?
Thanks in advance..
Piyush