Full-text retrieval fundamental

Build Index

  1. Data source(Documents) ready be index.
  2. Lexcial analysis, language processing, translate to Terms.
  3. Create dictionary, posting index table.
  4. Writen into hard disk, or other space.

Query process

  • A. Input query parse.
  • B. Lexcial analysis, language processing, translate to terms.
  • C. Syntax analysis, translate to a query tree.
  • D. Read hard disk index to memory
  • E. Get every term’s documents list by query tree, get result documents by ‘And’/‘Or’/‘Not’ operations on list.
  • F. Sort result documents by doc relevance.
  • G. Return query result.