350 руб
Журнал «Нейрокомпьютеры: разработка, применение» №1 за 2013 г.
Статья в номере:
Технологии извлечения из текстов информации о событиях в реальном времени
Авторы:
В.Д. Соловьев - д.ф.-м.н., профессор, проректор по информатизации, Высшая школа информационных технологий, Казанский (Приволжский) федеральный университет
Аннотация:
Среди различных задач обработки текстов и поиска в них информации выделяется область Information Extraction, в которой акцент делается на извлечение информации в форме фреймов о типовых ситуациях и/или сущностях. Дана краткая характеристика и классификация подходов к решению этой задачи, существующих сфер применения систем извлечения информации, качественной и количественной оценки качества работы таких систем. Отмечены ограничения на архитектуру таких систем в случае, когда приходится обрабатывать тексты в реальном времени.
Страницы: 23-30
Список источников
- Gerber, M., Gordon, A. S., and Sagae, K., Open-domain commonsense reasoning using discourse relations from a corpus of weblog stories. In Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading (Stroudsburg, PA, USA, 2010), FAM-LbR - 10, Association for Computational Linguistics. P. 43-51.
- Dey, L., Mahajan, A., and Haque Mirajul, S., Document clustering for event identification and trend analysis in market news. In Advances in Pattern Recognition, 2009.ICAPR - 09. Seventh International Conference on (feb. 2009). P. 103 - 106.
- Borsje, J., Hogenboom, F., Frasincar, F., Semi-Automatic Financial Events Discovery Based on Lexico-Semantic Patterns. International Journal of Web Engineering and Technology 6(2). 2010. Р. 115-140.
- Capet, P., Delavallade, T., Nakamura, T., Sandor, A., Tarsitano, C., Voyatzi, S., Intelligent Information Processing IV, IFIP International Federation for Information Processing. 2008. V. 288. chap. A Risk Assessment System with Automatic Extraction of Event Types. 2008. P. 220-229. Springer Boston.
- Frasincar, F., Borsje, J., Levering, L., A Semantic Web-Based Approach for Building Personalized News Services. International Journal of E-Business Research 2009. 5(3). Р. 35-53.
- Kamijo, S., Matsushita, Y., Ikeuchi, K., Sakauchi, M., Trac monitoring and accident detection at intersections // IEEE Transactions on Intelligent Transportation Systems. 2000. 1(2). Р. 108-118.
- Wei, C.P., Lee, Y.H., Event detection from Online News Documents for Supporting Environmental Scanning. Decision Support Systems. 2004. 36(4). Р85-401.
- Nadeau, D.,Satoshi Sekine, A survey of named entity recognition and classification.
- Smrž, P. and Mrnuštík, M., Decipher-D4.1.1-WP4-BUT State of the art of event detection methods-PU. Report.Brno University of Technology. 2011.
- Etzioni, O., Cafarella, M., Downey, D., Popescu, A.-M., Shaked, T., Soderland, S., Weld, D. S., Yates, A., Unsupervised Named-Entity Extraction from the Web: An Experimental Study. 2005. Artificial Intelligence 165.91-134, Essex: Elsevier Science Publishers.
- Witten, I. H., Bray, Z., Mahoui, M., Teahan, W. J. Using Language Models for Generic Entity Extraction // In Proc. International Conference on Machine Learning.Text Mining. 1999.
- Maynard, D., Tablan, V., Ursu, C., Cunningham, H., Wilks, Y. Named Entity Recognition from Diverse Text Types // In Proc. Recent Advances in Natural Language Processing. 2001.
- Zhu, J., Uren, V., Motta, E., Espotter: Adaptive Named Entity Recognition for Web Browsing. In Proc. Conference Professional Knowledge Management // Intelligent IT Tools for Knowledge Management Systems. 2005.
- Brin, S., Extracting Patterns and Relations from the World Wide Web. In Proc. Conference of Extending Database Technology. Workshop on the Web and Databases. 1998.
- Cohen, W. W., Sarawagi, S., Exploiting Dictionaries in Named Entity Extraction: Combining Semi-Markov Extraction Processes and Data Integration Methods // In Proc. Conference on Knowledge Discovery in Data. 2004.
- Bick, E., A Named Entity Recognizer for Danish.In Proc. Conference on Language. 2004.
- Shen D., Zhang, J., Zhou, G., Su, J., Tan, C. L., Effective Adaptation of a Hidden Markov Model-based Named Entity Recognizer for Biomedical Domain. In Proc. Conference of Association for Computational Linguistics.Natural Language Processing in Biomedicine.Resources and Evaluation. 2003.
- Settles, B., Biomedical Named Entity Recognition Using Conditional Random Fields and Rich Feature Sets. In Proc. Conference on Computational Linguistics.Joint Workshop on Natural Language Processing in Biomedicine and its Applications. 2004.
- Rindfleisch, T. C., Tanabe, L., Weinstein, J. N. EDGAR: Extraction of Drugs, Genes and Relations from the Biomedical Literature. In Proc. Pacific Symposium on Biocomputing. 2000.
- Narayanaswamy, Meenakshi, Ravikumar, K. E., Vijay-Shanker, K. A Biological Named Entity Recognizer. In Proc. Pacific Symposium on Biocomputing. 2003.
- Segers, R., van Erp, M., van der Meij, L., Aroyo, L., Schreiber, G., Wielinga, B., van Ossenbruggen, J., Oomen, J., and Jacobs, G. Hacking History: Automatic Historical Event Extraction for Enriching Cultural Heritage Multimedia Collections. In Proceedings of the 6th International Conference on Knowledge Capture KCAP11. 2011. P. 1-4.
- Vossen, P., Schreiber, G., and van Harmelen, F. The semantics of history: model, methods and application. http://www2.let.vu.nl/oz/cltl/semhis. 2009.
- Rizzi, V., Giunchiglia, F., Trecarichi, G., Teyssou, D., Murdock, V., de Polo, A., and Mezaour, A.-D.Project GLocal, Deliverable D1.1 - requirements for event modelling, representation and use. 2010.
- Collins, T. D., Mulholland, P., and Zdrahal, Z., Using mobile phones to map online community resources to a physical museum space. Int. J. Web Based Communities 5 (November 2009). Р. 18-32.
- Ахо А.,Ульман Дж.Теория синтаксического анализа, перевода и компиляции. М.: Мир. 1978.
- Рассел С., Норвиг П. Искусственный интеллект: современный подход = ArtificialIntelligence: a ModernApproach / пер. с англ. и ред. К. А. Птицына. Изд. 2-е. М.: Вильямс.2006.
- Kluegl, P., Atzmueller, M., and Puppe, F.TextMarker: A Tool for Rule-Based Information Extraction // Proc. Unstructured Information Management Architecture UIMA, 2nd UIMA@GSCL Workshop. 2009 Conference of the GSCL GesellschaftfürSprachtechnologie und Computerlinguistik.2009.
- Nitin Indurkhya and Fred,J. Damerau. Handbook of Natural Language Processing (2nd ed.). 2010. Chapman & Hall/CRC.
- Hogenboom, F., Frasincar, F., Kaymak, U., and de Jong., F. An Overview of Event Extraction from Text // Workshop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011) at Tenth International Semantic Web Conference (ISWC 2011). 2011. V. 779. P. 48-57. CEUR-WS.org.
- Рожнов А. В., Жарков И. Д. Алгоритмизация интеллектуальной обработки данных в задачах слабо формальных систем // Нейрокомпьютеры: разработка, применение. 2008. № 1-2.