Subject. When evaluating enterprises, maximum accuracy and comprehensiveness of analysis are important, although the use of various indicators of organization’s financial condition and external factors provide a sufficiently high accuracy of forecasting. Many researchers are increasingly focusing on the natural language processing to analyze various text sources. This subject is extremely relevant against the needs of companies to quickly and extensively analyze their activities. Objectives. The study aims at exploring the natural language processing methods and sources of textual information about companies that can be used in the analysis, and developing an approach to the analysis of textual information. Methods. The study draws on methods of analysis and synthesis, systematization, formalization, comparative analysis, theoretical and methodological provisions contained in domestic and foreign scientific works on text analysis, including for purposes of company evaluation. Results. I offer and test an approach to using non-numeric indicators for company analysis. The paper presents a unique model, which is created on the basis of existing developments that have shown their effectiveness. I also substantiate the use of this approach to analyze a company’s condition and to include the analysis results in models for overall assessment of the state of companies. Conclusions. The findings improve scientific and practical understanding of techniques for the analysis of companies, the ways of applying text analysis, using machine learning. They can be used to support management decision-making to automate the analysis of their own and other companies in the market, with which they interact.
Altman E.I. Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy. The Journal of Finance, 1968, vol. 23, no. 4, pp. 589–609. URL: Link
Gorbachev A.S., Drogovoz P.A. [Forecasting as a tool for advanced development of technological competencies in industry]. Kreativnaya ekonomika = Journal of Creative Economy, 2020, vol. 14, no. 12, pp. 3427–3438. URL: Link (In Russ.)
Drogovoz P.A., Rassomgin A.S. [Review of modern methods of data analysis and their usage for management problem solving]. Ekonomika i predprinimatel'stvo = Journal of Economy and Entrepreneurship, 2017, no. 3, pp. 689–693. (In Russ.)
Luger G.F. Iskusstvennyi intellekt: strategii i metody resheniya slozhnykh problem [Artificial Intelligence. Structures and Strategies for Complex Problem Solving]. Moscow, Vil'yams Publ., 2005, 864 p.
Gorbatkov S.A. et al. Metodologicheskie osnovy razrabotki neirosetevykh modelei ekonomicheskikh ob"ektov v usloviyakh neopredelennosti [Methodological foundations for the development of neural network models of economic objects in conditions of uncertainty]. Moscow, Ekonomicheskaya gazeta Publ., 2012, 494 p.
Hebb D.O. The Organization of Behavior. Wiley, New York, 1949, 335 p.
Callan R. Osnovnye kontseptsii neironnykh setei [The Essence of Neural Networks]. Moscow, Vil'yams Publ., 2001, 288 p.
Gorbachevskaya E.N., Krasnov S.S. [The history of the development of neural networks]. Vestnik Volzhskogo universiteta im. V.N. Tatishcheva = Vestnik of Volzhsky University after V.N. Tatischev, 2015, no. 1, pp. 52–56. URL: Link (In Russ.)
Deboeck G., Kohonen T. Analiz finansovykh dannykh s pomoshch'yu samoorganizuyushchikhsya kart [Visual Explorations in Finance with Self-Organizing Maps]. Moscow, Al'pina Pablisher Publ., 2001, 317 p.
Krasnov M.A. [A method to predict the dynamics of financial time series in investing]. TERRA ECONOMICUS, 2009, vol. 7, no. 1, part 2, pp. 93–98. URL: Link (In Russ.)
Kohonen T. Self-Organizing Maps, NY., Springer-Verlag, Berlin Heidelberg, 2001, 317 p.
Silva B., Marques N. Ubiquitous Self-Organizing Map: Learning Concept-Drifting Data Streams. In: Rocha A., Correia A., Costanzo S., Reis L. (eds) New Contributions in Information Systems and Technologies. Advances in Intelligent Systems and Computing, 2015, vol. 353, Springer, Cham. URL: Link
Zagoruiko N.G., Kutnenko O.A. [Training Dataset Censoring]. Vestnik Tomskogo gosudarstvennogo universiteta. Upravlenie, vychislitel'naya tekhnika i informatika = Tomsk State University Journal of Control and Computer Science, 2013, no. 1, pp. 66–73. URL: Link (In Russ.)
Bishop C.M., Svensen M., Williams C.K.I. Developments of the generative topographic mapping. Neurocomputing, 1998, vol. 21, iss. 1, pp. 203–224.
Hochreiter S., Schmidhuber J. Long Short-Term Memory. Neural Computation, 1997, vol. 9, iss. 8, pp. 1735–1780. URL: Link
Turing A.M. Mozhet li mashina myslit'? (S prilozheniem stat'i Dzh. fon Neimana “Obshchaya i logicheskaya teoriya avtomatov”) [Can the Machine Think? (With the article by J.R. Newman “The General and Logical Theory of Automata”)]. Moscow, Gosudarstvennoe izdatel'stvo fiziko-matematicheskoi literatury Publ., 1960.
Khlopenkova A.Yu., Belov Yu.S. [Methods of natural language processing in voice-controlled assistants]. E-Scio, 2019, no. 11, pp. 167–173. URL: Link (In Russ.)
Yuskov V.S., Barannikova I.V. [Comparison of platforms of natural language processing]. Gornyi informatsionno-analiticheskii byulleten' (nauchno-tekhnicheskii zhurnal) = Mining Informational and Analytical Bulletin (Scientific and Technical Journal), 2017, no. 3, pp. 272–278. URL: Link (In Russ.)
Mai Feng, Tian Shaonan, Lee Chihoon, Ma Ling. Deep Learning Models for Bankruptcy Prediction using Textual Disclosures. European Journal of Operational Research, 2019, vol. 274, iss. 2, pp. 743–758. URL: Link
Fedorova E.A., Khrustova L.E., Demin I.S. [Completeness of non-financial disclosure by Russian companies: The influence on investment attractiveness]. Rossiiskii zhurnal menedzhmenta = Russian Management Journal, 2020, vol. 18, no. 1, pp. 51–72. URL: Link (In Russ.)
Fedorova E.A., Afanas'ev D.O., Nersesyan R.G., Ledyaeva S.V. [Impact of non-financial information on key financial indicators of Russian companies]. Zhurnal novoi ekonomicheskoi assotsiatsii = Journal of the New Economic Association, 2020, no. 2, pp. 73–96. URL: Link (In Russ.)
Grinin I.L. [Developing, testing, and comparing models for sentimental short texts analysis]. Innovatsii i investitsii = Innovation and Investment, 2020, no. 6, pp. 186–189. URL: Link (In Russ.)
Voronov V.I., Martynenko E.V. [Research of parallel structures of neural networks for use in the tasks on the Russian text semantic classification considering limited computing resources (on the example of operational reports used in the RF MIA)]. Ekonomika i kachestvo sistem svyazi = Economics and Quality of Communication Systems, 2018, no. 3, pp. 52–60. URL: Link (In Russ.)
Drogovoz P.A., Koren'kova D.A. [Modern tools for agile management of IT projects and prospects for its improvement using artificial intelligence technologies]. Ekonomika i predprinimatel'stvo = Journal of Economy and Entrepreneurship, 2019, no. 10, pp. 829–833. (In Russ.)
Bikonov D.V., Brazhkin A.A., Sivtsov A.S. et al. [High-level parallel programming system for multicore hybrid processors networks]. Nanoindustriya, 2020, vol. 13, no. S4, pp. 94–96. URL: Link (In Russ.)
Kaftannikov I.L., Parasich A.V. [Problems of training set’s formation in machine learning tasks]. Vestnik Yuzhno-Ural'skogo gosudarstvennogo universiteta. Ser.: Komp'yuternye tekhnologii, upravlenie, radioelektronika = Bulletin of South Ural State University. Computer Technologies, Automatic Control, Radio Electronics, 2016, vol. 16, no. 3, pp. 15–24. URL: Link (In Russ.)
Rubtsova Yu.V. [Constructing a text corpus for tone classification setting]. Programmnye produkty i sistemy = Software & Systems, 2015, no. 1, pp. 72–78. URL: Link (In Russ.)
Veganzones D., Séverin E. An investigation of bankruptcy prediction in imbalanced datasets. Decision Support Systems, 2018, vol. 112, pp. 111–124. URL: Link
Hochreiter S., Schmidhuber J. Long short-term memory. Neural Computation, 1997, vol. 9, no. 8, pp. 1735–1780. URL: Link
Grechachin V.A. [The issue of text tokenization]. Mezhdunarodnyi nauchno-issledovatel'skii zhurnal = International Research Journal, 2016, no. 6-4, pp. 25–27. URL: Link (In Russ.)