|
ABSTRACTS OF ARTICLES OF THE JOURNAL "INFORMATION TECHNOLOGIES".
No. 9. Vol. 31. 2025
DOI: 10.17587/it.31.485-495
ฮ. I. Zakharova, Ph.D., Associate Professor, Deputy Head of Research Laboratory,
Research Laboratory of Artificial Intelligence, Volga Region State University of Telecommunications and Informatics, Samara, 443090, Russian Federation
Development of Intellectual Information System for Solving Typical Tasks of Text Data Analysis, Processing and Synthesis
Received on 07.12.2024
Accepted on 12.02.2025
We consider the architecture of an intelligent information system (hereinafter IIS), which contains the necessary components for solving typical tasks of analysis, processing and synthesis of text data, as well as means for configuring such components and monitoring their operation. In this case, the main task was to create a modular structure of the system, allowing horizontal scaling by flexible customization of the components and the sequence of their work as part of a single automated data processing pipeline. To describe the system structure we used the "C4" model of three levels: Context, Containers, Components. The 4th level "Code" is not used in this article, as it represents, designing a solution for module level abstractions. As an example for the design we have chosen an information system for collecting data on literary works, processing unstructured texts of such documents, loading the information into the analytical storage and analytical constructions based on the available data. To test the design solutions, a service catalog for data extraction, transformation and loading pipelines, a catalog of interaction contracts between services and a project structure for a containerization and multi-container management tool Docker Compose were developed. Apache Airflow and its REST API for integration with the IIS configuration component was used as an orchestration solution. To configure and optimize the system settings, a diagram of components of the module of own development in Python using the Django framework is given, as well as a description of the logic of its components. In conclusion, the advantage of the used approach over the implementation of ready-made software products with a set of necessary functions is described.
Keywords: information systems, information systems architecture, text data processing, modular system architecture, configuration selection
P. 485-495
Full text on eLIBRARY
References
- Ivanov V. K., Palyukh B. V., Sotnikov A. N. Architecture of intellectual system of information support of innovations in science and education, Programmnye produkty i sistemy, 2013, no. 4, pp. 197202 (in Russian).
- Martin R. Pure Architecture. The Art of Software Development, SPb, Piter, 2020, pp. 2633 (in Russian).
- Barseghyan A. A. Data and Process Analysis: textbook, SPb., BHV-Peterburg, 2018, pp. 3641 (in Russian).
- Berson A., Dubov L. Master data management, M., Publishing House "Noosphere", 2017, pp. 103105 (in Russian).
- Harenslak B., de Ruyter J. Apache Airflow and data processing pipelines, Moscow, DMK Press, 2021, pp. 346347 (in Russian).
- Schoenig G.-J. PostgreSQL 11. Mastery of development, Moscow, DMK Press, 2019, pp. 2526 (in Russian).
- Osipov D. L. Technologies of database design, Moscow, DMK Press, 2019, pp. 467468 (in Russian).
- Anderson K. Analytical Culture. From data collection to business results, Moscow, Mann, Ivanov & Ferber, 2017, pp. 81 (in Russian).
- Forcier J., Bissex P., Chan W. Django. Development of Web Applications in Python, SPb., Symbol Plus, 2009, pp. 170172 (in Russian).
- Arsky Y. M., Finn W. K. Principles of the intelligent systems design, ITiVS, 2008, no. 4, pp. 437 (in Russian).
- Evteeva E. V. Features and signs of information systems intelligence, Vestnik VUiT, 2010, no. 15, (in Russian).
- Vorobyeva I. A., Lazareva M. V. System and activity approaches in the design of intellectual learning system, Gumanitarnye issledovaniya Central'noj Rossii, 2024, no. 2 (31), (in Russian).
- Densmore J. Data Conveyors. Pocket guide, Astana, ALIST, 2024, pp. 2730 (in Russian).
- Grant S. Ingersoll, Thomas S. Morton, Andrew L. Farris Unstructured Text Processing. Retrieval, organization, and maญnipulation, Moscow, DMK Press, 2015, pp. 178179 (in Russian).
- Bolshakova E. I., Vorontsov K. V., Efremova N. E., Kly-shinsky E. S., Lukashevich N. V., Sapin A. S. Automatic natural language text processing and data analysis, Moscow, Publishing house of NIU HSE, 2017, pp. 147148 (in Russian).
- Pivotto J., Brasil B. Launch Prometheus, Moscow, Books.kz, 2023, pp. 5657 (in Russian).
To the contents |
|