Radiotekhnika
Publishing house Radiotekhnika

"Publishing house Radiotekhnika":
scientific and technical literature.
Books and journals of publishing houses: IPRZHR, RS-PRESS, SCIENCE-PRESS


Тел.: +7 (495) 625-9241

 

Efficient processing phrase queries using the combined indexes

Keywords:

I.A. Barankova – Programmer, LLC «ICRC I-Teco» (Moscow)
E-mail: iree-mars@yandex.ru
M.V. Vinogradova – Ph. D. (Eng.), Associate Professor, Department «Information Processing and Control Systems», Bauman Moscow State Technical University
E-mail: vinogradova.m@bmstu.ru
M.V. Chernenky – Associate Professor, Department «Information Processing and Control Systems», Bauman Moscow State Technical University
E-mail: chernen@bmstu.ru


This article describes how to organize the search phrase queries in the index structure the most effective with minimal additional memory consumption. For this purpose approaches the composite index: the part of phrase queries is processed using a standard in-verted index, the most common phrase is processed using a phrase index, the most frequently occurring words is processed using the index the following words. It is shown that the use of composite index for processing phrase queries is significantly reduced time of search.

References:
  1. Xoxlova M.V. E'ksperimental'naya proverka metodov vy'deleniya kollokaczij. Slavica Helsingiensia 34. Instrumentarij rusistiki: korpusny'e podxody'. Xel'sinki. 2008. S. 343−357.
  2. Paynter G.W., Witten I.H., Cunningham S.J., Buchanan G. Scalable browsing for large collections: A case study // Proc. ACM Digital Libraries. ACM Press. New York. San Antonio. California. 2000. P. 215−223.
  3. Bahle D. Efficient phrase querying. Ph.D. thesis. School of Computer Science and Information Technology. RMIT. 2003.
  4. Saraiva P.C., Moura E.S., Ziviani N., Fonseca R., Meira W., Murta C., Ribeiro-Neto B. Rank-preserving two-level caching for scalable search engines // Proc. ACM-SIGIR Int. Conf. on Research and Development in Information Retrieval. Eds: Croft W.B., Harper D.J., Kraft D.H., Zobel J. ACM Press. New Orleans, Louisiana. 2001. P. 51−58.
  5. Heinz S., Zobel J. Practical data structures for managing small sets of strings // Proc. Australasian computer science conf., Melbourne, Australia. 2002. P. 75−84.
  6. Scholer F., Williams H.E., Yiannis J., Zobel J. Compression of inverted indexes for fast query evaluation // Proc. ACM-SIGIR Int. conf. on research and development in information retrieval. Tampere (Finland). August 2002. P. 222−229
  7. Büttcher S.Cormack, Charles L.A. Clarke, Gordon V. Information Retrieval: Implementing and Evaluating Search Engines. MIT Press. 2010. 606 c.
  8. Manning K., Ragxavan P., Shyutcze X. Vvedenie v informaczionny'j poisk: Per. s angl. M.: Vil'yams. 2011. 528 s.
  9. Leont'eva N.N. Avtomaticheskoe ponimanie tekstov: sistemy', modeli, resursy'. M.: Akademiya. 2006. 304 s.
  10. Lande' D.V., Snarskij A.A., Bezsudnov I.V. Internetika. Navigacziya v slozhny'x setyax. Modeli i algoritmy'. M.: Librokom. 2009. 264 s.
  11. Trifanov A.A. Algoritmy' postroeniya invertirovannogo indeksa dlya kollekczii tekstovy'x danny'x // Izvestiya VUZOV. Povolzhskij rajon. Texnicheskie nauki. 2013. № 3(27). S. 52−61.

© Издательство «РАДИОТЕХНИКА», 2004-2017            Тел.: (495) 625-9241                   Designed by [SWAP]Studio