350 rub
Journal Highly available systems №2 for 2011 г.
Article in number:
Categorization of websites for blocking web pages with inappropriate content
Authors:
D.V. Komashinskiy, I.V. Kotenko, A.A. Chechulin
Abstract:
The paper considers in process of development and implementation of the common Web content categorization approach. The approach is focused on the actual problem of Web content categorization whose solving is necessary for protecting users from undesired access to Web resources containing forbidden and unacceptable content. The hierarchical multilayer set of classifiers designed for supporting the common categorization process on the base of URL, text content, tags and links data analysis is described
Pages: 102-106
References
  1. Зозуля Ю.В., Котенко И.В. Блокирование Web-сайтов с неприемлемым содержимым на основании выявления их категорий // РусКрипто-2010. 2010.
  2. Han J., Kamber M. Data Mining: Concepts and Techniques. Elsevier. Morgan Kaufman. 2006.
  3. Cooley R., Mobasher B. and Srivastava J. Web Mining: Information and Pattern Discovery of the World Wide Web // Proceedings of the 9th International Conference on Tools with Artificial Intelligence. 1997.
  4. Qi X., Davison B.D. Web Page Classification: Features and algorithms // ACM Computing Surveys (CSUR). 2009.
  5. Кузнецов Р.Ф. Классификатор веб-страниц на базе SVM-Multiclass // Труды РОМИП. 2006.
  6. Kleinberg J.M., Kumar R., Raghavan P., Rajagopalan S. and Tomkins A.S. The Web as a Graph: Measurements, Models, and Methods // Lecture Notes in Computer Science, Springer. V. 1627. 1999.
  7. Kuncheva L. Combining Pattern Classifiers: Methods and Algorithms. Wiley Interscience. 2004.