[Namazu-devel-en] Fw: Search Algorithm used for Keyword Search
dhirajr57 at yahoo.com
Tue Apr 11 12:15:49 JST 2017
I'm doing a literature survey on search engines for my thesis. I want to know the data structure that is used for accessing the keywords list (term list) that typically resides in files. Some RDBMSs and search engine like Xapian use B-Tree for accessing keywords. They seem to augment the performance using sparse index.
Do you use the same technique in your search engine?. Kindly provide me this information as soon as possible.
--- On Sat, 8/4/17, Dhiraj R <dhirajr57 at yahoo.com> wrote:
I'm doing a literature survey on search engines. As Namazu is open source, I think I can get the information required by me.
I assume that your system builds a list of keywords and tags to every keyword the documents where it can be found. My questions are as follows:
1. What is the search algorithm used for searching the list of keywords that your search engine has?. Is it the binary search algorithm or some enhancement of it using perhaps some additional data structures?
2. Are the keywords listed in alphabetical order or in some other order?
3. Does search engine like 'google' use only a binary search or any augmented version of binary search for searching the list of keywords that it maintains?
As I could not get these information from anyone, I request you to kindly provide me the above information as I need them for my thesis work.
More information about the Namazu-devel-en