350 rub
Journal Information-measuring and Control Systems №5 for 2013 г.
Article in number:
Discovering new technological trends in texts collections: hybrid models and data patterns time series analysis
Authors:
V.F. Khoroshevsky
Abstract:
New technological trends identification is one of the most sophisticated, as well as the most important, tasks in the domain of S&T analysis. Nowadays, the leading methodologies within the domain are focused mainly on technological roadmapping, Foresight, data patterns and time series analysis, which is used to specify current and projected trends. The paper presents intelligent tools for trend identification in texts collections with hybrid approach based on the integration of classical statistical methods and the methods of information extraction. Several existing approaches are combined to be used for multilingual text collections of various genres. Ontologies driving text processing as well as documents - characteristic vectors containing multiword terms are used. The results of statistical analysis of document collections are presented in the form of data patterns time series that are analyzed with structural methods of image analysis. OWL representation of extensional part of the trend ontological model is generated.
Pages: 25-34
References

 

  1. Preobrazhenskijj A. B., KHoroshevskijj V. F. Strukturnaja model vosprijatija okruzhajushhejj sredy // Voprosy radioehlektroniki. Ser. «Obshhetekhnicheskaja». Vyp. 13. 1971.
  2. Preobrazhenskijj A. B., Rybina G. V., KHoroshevskijj V. F. MIVOS - mnogocelevaja estestvenno-jazykovaja sistema. Izv. AN SSSR. Tekhnicheskaja kibernetika. 1979. № 6. S. 142 - 151.
  3. KHoroshevskijj V. F. ATNL - jazyk predstavlenija lingvisticheskikh znanijj v estestvenno-jazykovykh sistemakh. // «Voprosy kibernetiki». Vyp. 055. Intellektualnye banki dannykh. Pod red. L. T. Kuzina. M.: Sov. Radio. 1979. S. 158 - 168.
  4. KimYoungho, TianYingshi, JeongYoonjae, JiheeRyu, MyaengSung-HyonAutomaticDiscoveryofTechnologyTrendsfromPatentText. In: Proc, SAC-09, March 8 - 12. 2009. Honolulu, Hawaii. U.S.A. 2009. P. 1480-1487.
  5. Wang Ming-Yeu, Chang Dong-Shang, Kao Chih-Hsi Identifying technology trends for RD planning using TRIZ and text mining. RD Management. 2010. V. 40. № 5. P. 491-509.
  6. Kontostathis A., Galitsky L., Pottenger W. M., Roy S., Phelps D. J. A survey of emerging trend detection in textual data mining. In: Survey of Text Mining. 2003. P. 185-224.
  7. Glance Natalie S., Hurst Matthew, Tomokiyo Takashi BlogPulse: Automated trend discovery for weblogs // WWW 2004. Workshop on the webloging ecosystem: aggregation, analysis and dymanics, ACM. 2004.
  8. Daim T. U., Rueda G., Martin H., & Gerdsri P. Forecasting emerging technologies: Use of bibliometrics and patent analysis. Technological Forecasting & Social Change. 2006. V. 73. № 8. P. 98-1012.
  9. Bagheri S. K., Nilforoushan H., Rezapour M., Rashtchi M. A new approach to Technology Roadmapping in the Open Innovation context: The Case of Membrane Technology for RIPI. Journal of Science & Technology Policy. Spring 2009. V. 2. № 1.
  10. Goorha S., Ungar L. Discovery of Significant Emerging Trends. In: Proc. of KDD-10, July 25-28, 2010. Washington: DC. USA. 2010.
  11. Pantel P., Lin D. A statistical corpus-based term extractor // Lecture Notes in Artificial Intelligence. Springer-Verlag. 2001. P. 36 - 46.
  12. Naoki S., Yuya K., Yoshiyuki T., Katsumori M. Detecting emerging research fronts based on topological measures in citation networks of scientific publications. Technovation, V. 28. Issue 11. November 2008. P. 758-775.
  13. R. Nallpati Semantic language models for topic detection and tracking. In Proceedings of the conference of the North American chapter of the Association for Computational Linguistics on Human Language Technology (HLTNAACL - 03), 2003.
  14. Tomokiyo T., Hurst M.A language model approach to keyphrase extraction. In: Proceedings of the ACL Workshop on Multiword Expressions, 2003.
  15. Yoon B. Park Y. A text mining-based patent network: analytical tool for high-technology trend. Journal of High Technology Management Research.2004. V. 15 (1).
  16. Aleskerov F. T., Gokhberg L. M., Egorova L. G., Mjachin A., Sagieva G. S.Analiz dannykh nauki, obrazovanija i innovacionnojj dejatelnosti s ispolzovaniem metodov analiza patternov. // Preprint WP7/2012/07 [Tekst] / F. T. Aleskerov i dr. Nac. issled. un-t «Vysshaja shkola ehkonomiki». M.: Izd. dom Vysshejj shkoly ehkonomiki, 2012.
  17. KHoroshevskijj V. F. Ob odnom metode semanticheskojj interpretacii patternov dannykh na osnove strukturnogo podkhoda. // Preprint WP7/2012/08 [Tekst] / V. F. KHoroshevskijj. Nac. issled. un-t «Vysshaja shkola ehkonomiki». M.: Izd. dom Vysshejj shkoly ehkonomiki, 2012.
  18. Gartner home page: http://www.gartner.com/technology/research.jsp
  19. Rud V. A., Fursov K. S. Rol statistiki v diskussii o nauchno-tekhnologicheskom i innovacionnom razvitii. // Voprosy ehkonomiki. 2011. № 1. S. 120 - 133.
  20. Efimenko I. V.Gibridnyjj podkhod k vyjavleniju kompleksnykh obektov v oblasti nauchno-tekhnicheskogo prognozirovanija: princip «chernogo jashhika». Vsb. trudovmezhdunar. konf. OSTIS-2013. Minsk. Belarus. 2013.
  21. Efimenko I., Minor S., Starostin A., Drobyazko G., Khoroshevsky V. Generating Semantic Content for the Next Generation Web, Chapter in Monograph «Semantic Web». Publisher IN-TECH, 2009.
  22. Developing Language Processing Components with GATE Version 7 (a User Guide). http://gate.ac.uk/sale/tao
  23. Morfologija. http://company.yandex.ru/technology/mystem/
  24. Witte R., Khamis N., Rilling J. Flexible ontology population from text: The owlexporter. In International Conference on Language Resources and Evaluation (LREC). Valletta. Malta. 05/2010 2010.
  25. Narasimhan R. N. Syntax-directed interpretation of classes of pictures, Comm. ACM, 9, 1966. P. 166 - 173. Rus. per.: Narasimkhan P. Sintaksicheskaja interpretacija klassov izobrazhenijj: V sb. «Avtomaticheskijj analiz slozhnykh izobrazhenijj. Mir, 1969.
  26. Shaw A. C. A formal picture description scheme as a basis for picture processing system, Information and Control. 1969. V. 14. S. 9-52.
  27. Salton G., Buckley C. (1988). Term-weighting approaches in automatic text retrieval. // Information Processing and Management 24 (5): R. 513 ? 523.