Abstract

The object of research is the fundamental and technical indicators of companies after the release of the earnings report. This study attempts to address the issue of understanding the impact of fundamental and technical analysis indicator dynamics on profits and loss news releases. This research provides an in-depth analysis of stock price forecasting models, focusing on the influence of earning report seasons as catalysts for stock price growth. The study explores the relationship between key financial indicators, including earnings per share (EPS), revenue, and the maximum price observed in the 52-week period of the previous year (MaxW52). A trading algorithm is developed based on the adaptive neurofuzzy inference system (ANFIS). Through a comprehensive analysis of the neural network’s training sample, it is concluded that abnormally large negative indicators have a profound impact on traders’ emotional reactions. This results leads to a hypothesis for further research, suggesting that report indicators may be processed by computational algorithms, potentially including artificial intelligence (AI). Consequently, the emergence of emotional trading robots managed by investment funds becomes a crucial area for investigation. Understanding the behavior of these algorithms enables proactive decision-making, allowing traders to leverage their knowledge and sell-purchased securities to these algorithms before their transactions occur. The implications of this research shed light on the evolving landscape of trading strategies and the role of emotionality in financial markets.

1. Introduction

Making investment profits, diversifying assets, and receiving dividends are the main reasons why companies, banks, funds, investors, and traders buy and sell stocks [1]. The right purchase of stocks can help the investor’s money grow. In the long term, the benefits of investing in stocks tend to far outweigh the benefits of storing money in less profitable assets such as cash or deposits [2]. The variety of stocks makes it possible to distribute risk during trading, as there is a large classification of assets, economic sectors, and geographic regions [3]. Investors can also receive income from the purchase of stocks in the form of regular dividends. They can keep this income or reinvest it.

The price of a stock is determined by the rule of supply and demand; until a reasonable price is reached, it will fluctuate between buyers and sellers. The ability to make a correct price forecast is the most important quality of a good trader [4]. As more people gain access to the stock markets due to the development of Internet technologies, more and more people are interested in finding out if there is any predictability in this market.

This study holds significant relevance in the field of finance, data analysis, natural language processing (NLP), and decision-making. The importance of gaining insight into the financial performance of companies cannot be overstated, making it essential to have an effective and efficient way to analyze earning reports. This research provides a solution to this critical need for investors, financial analysts, and decision-makers alike. Neurofuzzy systems coupled with NLP techniques offer an effective approach to glean valuable knowledge from the vast amounts of textual data in reports [5].

Analyzing business earning reports provides investors and financial experts the insight they need to make the right investment decisions, assess a company’s debt-to-equity ratio, recognize any potential risks, and accurately predict future performance. Reading company earnings reports can be terribly time consuming as they often contain a lot of textual data. Extracting key elements and making correct conclusions can be quite difficult. The growing availability of vast amounts of financial data, coupled with advancements in artificial intelligence and machine learning, presents an unprecedented opportunity to revolutionize the way financial analysis is conducted. By combining the power of neurofuzzy systems with NLP, analysts and researchers can easily automate the analysis process and gain a better understanding of company financial performance. The integration of neurofuzzy systems with NLP enables the automatic extraction, categorization, and understanding of crucial information from unstructured earning reports, thus streamlining the analysis process. This advanced method offers an effective and reliable way to access in-depth insights [6]. Utilizing this method, financial analysis can be more precise and productive which can result in wiser decisions. Therefore, studies that are devoted to the development of a neurofuzzy system for analyzing company earnings reports using NLP are highly relevant in today’s data-driven financial landscape.

NLP algorithms are essential in making sense of unstructured textual data and transforming it into useful, structured information. By making use of NLP algorithms including text parsing, named entity recognition, sentiment analysis, and topic modeling, this research has enabled extracting valuable information from financial statements and company earnings reports [7]. By implementation of certain techniques, it becomes easier to spot key financial indicators, patterns, and discrepancies that can give insights about the financial wellbeing and performance of organizations [8].

Neurofuzzy systems are a combination of artificial neural networks and fuzzy logic that works together to process complex and uncertain data [9]. This allows it to be more accurate than either technique alone. This research makes use of the capabilities of neurofuzzy systems to manage linguistic ambiguities and inexact financial data from earning statements. Neurofuzzy systems offer a convenient and clever way to develop algorithms with higher accuracy in predicting or classifying financial variables and making sense of complex linguistic expressions [10]. These sophisticated AI techniques make it possible to capture the intricate correlations between different data points.

The aim of the study is to create and evaluate a neurofuzzy system that can be utilized for interpreting corporate earnings reports with the application of natural language processing methods.

To achieve this aim, the following objectives are accomplished:(i)It is necessary to design an architecture of an adaptive neurofuzzy system, which will take into account the process of collecting information from Internet resources, the process of processing indicators of fundamental and technical analysis, and the process of obtaining a neurofuzzy conclusion;(ii)It is required to correlate the qualitative indicators of financial statements to filter the training data from unnecessary financial indicators;(iii)To confirm the results of the study and build a financial strategy, it is necessary to conduct a predictive experiment; this experiment will help us to fix the fuzzy inference rules and optimize the trading strategy.

2. Literature Review

Traditionally, price analysis is divided into fundamental and technical analysis [11]. The stock price can be affected by news, events, and information about the release of new products. Information determines the movement of financial markets according to the efficient market hypothesis (EMH) [12]. The EMH model assumes that stock prices are determined by a random walk model. The EMH model does not allow us to predict the market movement. But a number of researchers have refuted this hypothesis. This study serves as a comprehensive evaluation of multiple classifiers in the context of predicting the direction of stock prices, aiming to provide insights into the performance and suitability of various machine learning techniques in this domain [13]. This article explores the relationship between Twitter mood and the Dow Jones Industrial Average (DJIA). The study analyzes Twitter feeds to determine if they are correlated with the value of the DJIA over time. The authors found that mood states, but not sentiment, can improve the accuracy of predictions of the DJIA. The article suggests that public mood state increases prediction accuracy and that emotions and moods play an important role in human decision-making [14]. This paper discusses the use of deep learning networks for stock market analysis and prediction. The study presents a comprehensive big data analytics process to predict the daily return direction of the SPDR S&P 500 ETF based on 60 financial and economic features. The authors explore various deep learning algorithms based on the combination of network structure, activation function, model parameters, and their performance depending on the format of the data representation. The article provides insights into the use of deep neural networks for stock market prediction and highlights the importance of data representation in achieving accurate predictions [15]. The authors of this research [16] propose a hybrid approach that combines the predictions of multiple classifiers to improve the accuracy of stock market predictions. The article provides insights into the use of machine learning techniques for stock market analysis and prediction and highlights the importance of combining multiple classifiers to achieve accurate predictions. It cannot be denied that macroeconomic indicators, seasonal effects, and political events can affect the price of stocks [17]. Emotions and moods also influence the decision to buy or sell stocks [18]. With the development of Internet technologies, the growth of information, and the improvement of methods of obtaining information, investors receive an ever-growing amount of information every day. Social networks have become a new platform for influencing investors’ ideas and decisions. News sites broadcast thousands of news daily that instantly affect the stock market. Twitter, LinkedIn, and Facebook have made ordinary people become the main source of information exchange these days. Knowledge about finance, especially about the capital market, is spread instantly through social networks. The correlation between the behavior of the stock price, investor sentiment, and news is being investigated by many experts in the field of finance and natural language processing. The purpose of these studies is to find a new stock price forecasting tool.

Financial reports, official documents, and corporate online discussions can be used to predict market trends using fundamental analysis [19]. Reports, news, and articles about the company contain a lot of qualitative and quantitative information for fundamental analysis. The fundamental analysis is based on the company’s financial conditions and macroeconomic indicators, such as EBITDA, P/E, profit, return on equity, and dividend yield. Using this information, traders trade stocks with a value above or below their intrinsic value.

As time series of financial indicators have become more accessible and easier to process, researchers in the field of stock price forecasting have shifted their focus to technical analysis [20]. The basis of technical analysis is chart patterns, relative strength index, moving averages (MAs), moving average convergence/divergence (MACD), and money flow index.

In recent years, text mining has been widely used in stock price forecasting. The text is converted into a feature vector [21]. There are many ways to vectorize text, namely, word embedding, the most common word embedding methods are BOW, N-gram TD-IDF, and Word2Vec; deep learning-based methods such as embedding layers in neural networks, using pretrained vectors such as Elmo and BERT; noun phrases; n-gram; topic modeling, which is similar to the clustering of numerical data and is used for classification of documents; emotional words; and the Bag-of-Words model is a disorderly representation of the document text.

This study proposes a knowledge-driven text-embedding model that is trained on over 10 years of annual reports from Russell 3000 firms [22]. The authors show that their proposed method outperforms cutting-edge benchmarks in predicting stock return volatility. This study provides insights into the use of BERT and natural language processing techniques for stock market prediction and highlights the importance of incorporating user sentiments in predicting stock market trends with [23]. This study proposes a deep learning framework that incorporates expert-based investment opinion signals to predict stock prices [24]. The authors of [25] proposed an ontology-based approach to extract word combinations for natural language processing. This research study proposes a method for financial forecasting using character n-gram analysis and readability scores of annual reports [26]. This study proposes a supervised topic model to extract focused topics and sentiment of financial markets for price movement prediction [27]. The study provides insights into the use of dimensional valence-arousal approaches for sentiment analysis in financial markets and highlights the importance of incorporating sentiment information in predicting stock price movements [28].

Behavioral economics explores how emotions significantly influence financial decisions [29]. The optimistic or pessimistic mood of the society is used in data mining or social network analysis. Sentiment assessment is used to understand what emotions people feel when they read news and opinions. Sentiment analysis is mainly applied at the following three levels: document level, sentence level, and aspect level. Sentiment analyzers are most often based on machine learning methods and lexicon approaches. In the lexical approach, the classifier determines the polarity using a dictionary, and the quality of classification depends on the size of the dictionary. Using natural language processing and text analysis methods, it is possible to determine the mood of text content, i.e., positive, negative, or neutral. Each term, word, or phrase that contains emotions is assigned positive and negative meanings. Furthermore, by the sum of these values, it is determined to which category the document belongs. The most commonly used dictionaries are WordNet, SentiWordNet, and SenticNet.

In complex analytical systems, machine learning algorithms are used after nlp-converting of text into numbers. By using SVM algorithms, a naive Bayesian classifier, and the random forest algorithm, the movement of stock prices can be forecasted. DNN, MLP, CNN, RNN, LSTM, and other types of neural networks are also used to predict stock prices based on time series analysis [30]. Moreover, regression models are used to evaluate the stock market. Regression is a random process used to describe specific time-varying processes in nature, economics, etc. The autoregressive model determines that the result depends linearly on its initial values and the stochastic term.

The description of stock returns can be represented by an extended ANFIS [31]. In general, fuzzy logic is effectively used in the building expert systems [32]. ANN and the fuzzy system, or ANFIS, were applied to predict the profitability of financial ratios of Islamic banks in Saudi Arabia [33]. ANFIS and discrete particle swarm optimization were applied to forecasting the closing prices of stock indices in India [34]. When developing such systems, it is important to use high-performance programming languages, for example, Python [35], as well as pay attention to the convenience of the user interface [36].

Data on prices of opening and closing trades or indicators of stock exchange indices are mainly used as a sample for training neural networks [31, 33, 34]. In our study, we tried to solve the problem of interpreting profit and loss news to further predict stock price movement. In our research, we will use EPS data, revenue, 52-week high price, price before the publication of the report, price at the opening of trading, maximum price during the trading day, closing price, analyst expectations, data from the previous financial report, and others. The main novelty of the study is the use of not only a time series of stock price dynamics but also the use of indicators of financial reports and indicators of experts’ expectations in the projected predictive model. In the context of financial markets and stock price prediction using neural networks, most of the researchers [20, 30, 31, 33, 34] focused solely on time series data of price dynamics as the input to train their models. Considering the main aspects of this problem, we can say that using only time series about stock prices does not always lead to good results in research because relying only on the time series of price dynamics, which includes historical stock prices, might have limitations. Price dynamics alone may not capture all the relevant information about a stock’s behavior. Financial markets are influenced by a multitude of factors, including economic indicators, news events, company fundamentals, and investor sentiment. Using only price dynamics might lead to overfitting, where a model becomes too specific to the training data and performs poorly on new, unseen data. Including additional features can help reduce overfitting by providing a more robust representation of the underlying factors influencing stock prices.

3. Materials and Methods

US companies publish their quarterly, semiannual, and annual financial results in certain calendar periods. Brokers call these time periods the reporting season. Quarterly reports are used to conduct fundamental and technical analysis of the company’s financial position. Quarterly reports are used as an indicator of a company’s development trends, and they are also used to predict the company’s market price. The reporting season is one of the most important trading periods as volatility increases at this time. Volatility presents more opportunities for investors and traders to earn money through active speculation. Speculation manifests itself in an increase in the volume of traded stocks, as well as an increase in their range of price movements in general.

Earnings reports, also known as quarterly financial reports or earnings releases, are documents published by publicly traded companies to provide information about their financial performance during a specific period, usually on a quarterly basis. The Nasdaq stock exchange hosts a wide range of companies, and their earnings reports typically contain several key components. The following is an overview of the common elements found in these reports:(1)Financial Statements. (1.1) Income Statement (Profit and Loss Statement). This section outlines the company’s revenues, costs, and profits over a specific period. Key figures include revenue, gross profit, operating income, net income, and EPS. (1.2) Balance Sheet. This provides a snapshot of the company’s financial position at a specific point in time. It includes assets (such as cash, inventory, and property), liabilities (such as debts and obligations), and shareholders’ equity. (1.3) Cash Flow Statement. This document details how changes in the balance sheet and income statements affect cash and cash equivalents. It is divided into operating, investing, and financing activities.(2)Management Discussion and Analysis (MD&A): This is a narrative section where company management provides context and analysis of the financial results. It often includes discussions on market conditions, business strategies, and significant events that impacted performance.(3)Outlook and Forward-Looking Statements: Companies often provide guidance or outlook for future performance. This can include revenue projections, expected expenses, and other factors that may impact future earnings. In addition, companies include disclaimers about uncertainties and risks.(4)Non-GAAP (Generally Accepted Accounting Principles) Financial Measures: In addition to standard accounting metrics, companies may present non-GAAP measures to provide additional insight into their performance. These might exclude certain expenses or income items to present a clearer picture of underlying business trends.(5)Operational Metrics: Depending on the industry, companies may include key operational metrics relevant to their business. For technology companies, this might include user growth or engagement metrics, while a retail company might highlight same-store sales growth.(6)Conference Call and Questions and Answers: Many companies conduct earnings conference calls where executives discuss the results, provide insights, and answer questions from analysts and investors. These calls offer an opportunity for stakeholders to gain a deeper understanding of the company’s performance and future plans.

Speculation during the reporting season most often sets the price trend until the next report date, that is, for the whole quarter. Company executives prepare earning and loss statements. The correlation among the data on sales, income, losses of companies, as well as analysts’ recommendations is most often studied in the reporting season. Companies hold teleconferences on financial and business results in the form of a discussion session, which is attended by analysts, investors, and journalists. A transcript of the recording sessions can be found on the companies’ websites. At these meetings, company leaders present financial achievements and forecast future development plans. Detailed information is also provided on growth, risks, purchases, liabilities, lawsuits, stock repurchase programs, programs to increase or decrease the payment of dividends, in the management teams, and future goals of the company. The most important information is the EPS and revenue indicators, as well as their difference with analysts’ expectations.

Publishing a large amount of data from the transcripts on the Internet may allow us to more accurately forecast the directions of stock price movements. In this regard, from all the catalysts listed in introduction, we have chosen the reporting season.

Our research begins with constructing a hypothesis about how earning reports affect investor sentiment and determine the ability to predict price movement based on data from earnings reports.

In this article, we conduct an extensive study of the possibility to predict the behavior of stock prices using data of EPS and revenue indicators, as well as their difference with analyst expectations, the price approximation to 52-week high until the report publication, price changes for the last quarter and for the same period of the report delivery, and price dynamics at the opening of trading after report publication. EPS is a fundamental metric that reflects a company’s profitability on a per-share basis. It is widely used by investors and analysts to gauge a company’s financial health. A company with growing or beating EPS expectations may be seen as financially sound and could experience positive stock price movements. Revenue is a critical measure of a company’s top-line performance. Examining revenue indicators, such as total revenue and revenue growth, can provide insights into the company’s sales and market position. Strong revenue growth is generally viewed positively by investors. Comparing actual results with analyst expectations provides an indication of how well the company performed relative to market forecasts. A significant difference between reported earnings and what analysts anticipated can lead to stock price movements as it may indicate surprises or deviations from market expectations. The distance of the stock price from its 52-week high can provide insights into the stock’s momentum and potential resistance levels. Investors often look at this metric to assess whether a stock is trading near its peak or has room for further upside. Analyzing the stock price changes over the last quarter provides information on short-term performance trends. Investors may be interested in understanding how the stock has been moving in the recent past to identify potential patterns or trends. The immediate market reaction to earnings reports can be significant. Studying the price dynamics at the opening of trading after the report publication can provide insights into investor sentiment and reactions to the company’s financial results. The data sample was taken randomly and was not tied to a specific time period. In order to reduce the volume of data and obtain more realistic figures of price dynamics, companies were selected by several indicators, primarily by the average trading volume of stocks per day, and this indicator amounts more than 100,000 stocks per day. Another filter for companies was the exclusion from the sample of several sectors of the US economy, such as fintechs, oil and gas industry, insurance, cryptocurrency. Some sectors, such as cryptocurrency, can be highly volatile and sensitive to external factors. Excluding these sectors might be a way to reduce the overall volatility of the sample and make the analysis more stable. Certain sectors, such as finance and cryptocurrency, are subject to specific regulatory environments that can significantly impact their performance. These regulatory changes may not affect other sectors in the same way. By excluding sectors with unique regulatory challenges, we can create a more predictable and comparable dataset. Oil and gas companies are heavily influenced by commodity prices, geopolitical events, and macroeconomic factors. Some sectors have unique financial metrics that are more relevant to their performance. For instance, financial ratios like net interest margin are critical for banks. Excluding sectors where traditional financial metrics may not be as applicable allows for a more targeted analysis using metrics that are relevant across the selected companies.

To solve the abovementioned hypotheses, a functional model will be designed using a neural network, which we will train on the collected data from financial reports. The main goal is to find a correlation between the dynamics of price changes and published report data. We transform the resulting model into a model of production rules. The rule base will serve as the basis for developing a fuzzy expert system for forecasting stock prices.

The financial indicators of the company that we used as a sample for the study are the following:(i)EPS: earnings per share, or an indicator that determines the share of profit generated by the company, which accounts for one ordinary share outstanding [37]. EPS is calculated by dividing the company’s net income by the average number of outstanding shares during a specific period. The formula for EPS is as follows: (ii)AnalyticEPS: EPS expected by analysts [38]. Analysts use their expertise, financial models, and industry knowledge to predict a company’s future earnings and, consequently, its EPS. Analytic EPS is an important metric for investors as it provides insight into the market’s expectations for a company’s financial performance.(iii)DynamicEPS: calculation formula for the difference of the current EPS with analysts’ expectations, (EPS-AnalyticEPS)/AnalyticEPS∗100, indicated in %.(iv)EPS0: EPS for the last quarterly report [39].(v)AnalyticEPS0: EPS expected by analysts for the last quarterly report [38].(vi)DynamicEPS0: calculation formula for the difference of the EPS and analysts’ expectations for the last quarterly report, (EPS0-AnalyticEPS0)/AnalyticEPS0∗100, indicated in %.(vii)Revenue: revenue indicator [40]. Revenue is a key financial indicator that represents the total amount of money generated by a company from its primary business activities, such as sales of goods or services. Also referred to as sales or turnover, revenue is a crucial metric for assessing a company’s financial performance and overall health. The formula for calculating revenue is straightforward: . And, for service-based companies: .(viii)AnalyticRevenue: revenue indicator expected by analysts [41]. Analytic revenue, also known as analyst revenue, refers to the revenue estimate or forecast made by financial analysts who cover a specific company or industry. Similar to Analytic EPS, this metric represents the analysts’ predictions about a company’s future revenue based on their research and analysis.(ix)DynamicRevenue: calculation formula for the difference between the current revenue indicator and analysts’ expectations, (Revenue-AnalyticRevenue)/AnalyticRevenue∗100, indicated in %.(x)Revenue0: revenue indicator for the last quarterly report [40].(xi)AnalyticRevenue0: analysts’ expected revenue for the last quarterly report [41].(xii)DynamicRevenue0: calculation formula for the difference between the current revenue indicator and analysts’ expectations for the last quarterly report, (Revenue0-AnalyticRevenue0)/AnalyticRevenue0∗100, indicated in %.(xiii)MaxW52: the maximum stock price for the last 52 weeks [42]. This metric is commonly used in financial analysis and investment to provide insight into the historical performance of a stock. The maximum stock price over the past 52 weeks can offer information about a stock’s recent trading range and potential trends.(xiv)PriceBeforeOpen: stock price until the market opens after the report publication [42].(xv)СlosenessW52: calculation formula for the difference of the stock price with the price of the one-year maximum used to understand the upper limits of resistance and also used to predict a possible gap in an uptrend, (MaxW52-PriceBeforeOpen)/PriceBeforeOpen∗100, indicated in %.(xvi)PriceOpen: stock price at the market opening, probable purchase price of the stock by the trading robot [43].(xvii)DynamicPriceOpen: calculation formula for the stock price difference at the market opening with the price before the market opening after the report publication, (PriceOpen-PriceBeforeOpen)/PriceBeforeOpen∗100, indicated in %.(xviii)PriceDayTop: the maximum stock price on the day after the report is published; the first approximate selling price of the stock by the trading robot [44].(xxi)DynamicPriceDayTop: calculation formula for the difference of the maximum stock price with the stock price at the market opening on the day after the report publication, (PriceDayTop-PriceOpen)/PriceOpen∗100, indicated in %.(xx)PriceClose: stock price at the close of trading on the day after the report publication; the second approximate selling price of the stock by the trading robot [45].(xxi)DynamicPriceClose: calculation formula for the difference of the stock price at the close of trading with the stock price at the market opening on the day after the report publication, (PriceClose-PriceOpen)/PriceOpen∗100, indicated in %.(xxii)PriceOpen0: stock price at the market opening for the last quarterly report [43].(xxiii)PriceDayTop0: the maximum stock price on the day after the report publication for the last quarterly report, necessary to verify the possibility of the origin of the gap [44].(xxiv)DynamicPriceDayTop0: calculation formula for the difference between the maximum stock price and the stock price at the market opening on the day after the publication of the last quarterly report, (PriceDayTop0-PriceOpen0)/PriceOpen0∗100, indicated in %.(xxv)PriceClose0: stock price at the close of trading on the day after the publication of the last quarterly report, necessary to verify the possibility of the origin of the gap [45].(xxvi)DynamicPriceClose0: calculation formula of the stock price difference at the close of trading with the stock price at the market opening on the day after the publication of the last quarterly report, (PriceClose0-PriceOpen0)/PriceOpen0∗100, indicated in %.

These indicators will be collected from the following sites using nlp scraping methods:(i)https://finance.yahoo.com/quote/CDMO?p=CDMO&.tsrc=fin-srch(ii)https://finance.yahoo.com/quote/CDMO/history?p=CDMO(iii)https://finance.yahoo.com/quote/CDMO/profile?p=CDMO(iv)https://www.investing.com/equities/peregrine-pharmaceuticals-earnings(v)https://www.zacks.com/

The collected indicators will be transferred to an artificial neural network. The neural network will work on the Bayesian regularization algorithm. In the Bayesian structure, the neural network weights are treated as random variables, while the prior distribution of the network weights and the training sample are considered as a Gaussian distribution, so we have equation (1). A Sugeno-type fuzzy inference system will be modeled after training with data. The architecture of the adaptive neurofuzzy system is described in Figure 1.where M is a specific neural network architecture. ω is the vector of network weights. N is the total number of network weights. represents a priori density. is a likelihood function, which represents the probability of data occurrence, provided that the weights are given by . and are the normalization coefficients. According to Bayes’ rule, we get equationwhere is the normalization coefficient and has nothing to do with the weights of the network, which guarantees that the total probability will be equal to 1.

In order to read various data located on the abovementioned web pages, web scraping methods are used to systematize and further analyze the collected indicators. In fact, the information is collected and exported into a more convenient tabular format. In this way, we will be able to automatically receive new or updated data for the successful collection of the training sample. When scraping, we will use the JavaScript libraries for Node.js that control the Chrome browser without a user interface. With the help of this library, it is quite easy to automatically read data from various websites or create so-called web scrapers that simulate user actions.

The developed scrapper collected data from 98 company reports for 2022. The difficulty in collecting data was to determine the exact time of the report release. Companies can announce the report before the market opens or after the market closes. This means that the information published in the report after the close of trading begins to affect the stock price only the next day. The structure of this data is described in Figure 2.

4. Results and Discussion

4.1. Correlation of Qualitative Indicators of Financial Statements

2022 was a year of turmoil for the global economy. The complexity for the economy is caused by the following events: the bursting of the bubble of technology companies that have grown during the period of COVID-19; military conflict in Ukraine; tightening sanctions against Russia; rise in price of energy resources; and large rise in inflation. Conducting a study on price forecasting in an economically unstable period will make it possible to build a more flexible forecasting model that will be will be resistant to the formation of new economic crises. It should also be noted that we avoid the strategy of keeping stocks on hand for a long time. This research is conducted under the investment strategy “bought today, sold today”. A change in the text in Figure 2 to green color means that the report’s indicators exceeded the expectations of experts and red color means that the indicators did not reach the expectations of the market. The yellow colored columns indicate the percentage change of the previous columns. These qualitative indicators will be used in the analysis of the correlation to the price. The green columns are the purchase prices and the minimum selling price. Orange columns are qualitative indicators of probable profit on the trading day. All qualitative indicators were sorted and are shown in Table 1, and the correlation between them is displayed in Table 2. Correlation analysis consists of determining the degree of relationship between two random variables X and Y. The correlation coefficient is used as a measure of the closeness of such a relationship. The correlation coefficient is estimated from a sample size of n related pairs of observations (x, y) from the joint population of X and Y. A linear correlation coefficient in r is used to estimate the degree of relationship between the quantities X and Y measured in quantitative scales, assuming that samples X and Y are distributed according to the normal law.

The correlation results show that the daily maximum income, when buying a stock at the opening price, strongly depends on EPS and revenue, i.e., the main indicators of the financial report. It has also been observed that the dynamic price indicator is positively correlated in a weaker form with the closing price and the distance from the yearly high price. It was also discovered that last year the opening price of trading and the closing price of trading were strongly correlated with each other. These indicators are explained by the fact that 2021 was a good year in economic terms for many companies; many companies were on the rise after the lifting of restrictive measures in connection with COVID-19. It also indicates the absence of strong volatility during one trading day in 2021. And in 2022, companies are increasingly experiencing both a sharp rise in price and a sharp drop in price during one trading day. It has also been discovered that to a small extent, the closing price is negatively correlated with the yearly high price. That is, it can be concluded that if a company has submitted a bad report but the current price of the company is far from the price of a yearly maximum, then the drop will be minimal, and vice versa, if a company has submitted a bad report and is close to the price of a yearly maximum, then the drop will be maximum. Also, if the company has submitted a good report but the current price of the company is far from the price of a yearly maximum, then the growth will be maximum, and vice versa, if the company has submitted a good report and is close to the price of a yearly maximum, then the growth will be minimal. It is impossible to take into account correlation calculations with price dynamics indicators for the past time since the indicator of a yearly maximum in the past was different.

4.2. Price Prediction Experiment for Building Trading Strategy

Based on the correlation results, we chose DynamicPrice as the output parameter and DynamicEPS, DynamicRevenue, and СlosenessW52 as the input parameters. We will not use DynamicPriceClose as an output parameter because it correlates worse with other variables than DynamicPrice. We will also bring these parameters to normalization in the range from −1 to +1. A fragment of normalization data is shown in Table 3.

Based on these data, we will train the neural network. 70% of the sample data will be used for training, 15% for validation, and 15% for testing. The neural network will consist of 50 layers and work on the Bayesian regularization algorithm. Traditional neural network models consider mean squared errors (MSEs) as a learning objective function and do not optimize the network weights.

The training results are described in Figure 3. Also, Figure 4 presents a series of graphs of training states, i.e., gradient, Mu, number of parameters, sum of squares of parameters, and validation check depending on epochs. Figure 5 shows the graph for obtaining the best training performance indicator = 0.000609 received at epoch 983. In Figure 6 is described the error histogram chart and indicators of training regression R = 0.68741, testing regression R = 0.16651, and general regression R = 0.54211.

After testing, the regression became equal to R = 0.51539. The test schedules are described in Figures 7 and 8.

The received error result of neural network is generally good and is equal to 0.000609. But to develop an algorithm for buying and selling securities, we will need an expert system. To solve this problem, we have modeled an adaptive neurofuzzy inference system. The training took place in 100 epochs, and the minimum RMSE was 0.0260256. Figure 9 shows that RMSE stopped changing after the 7th epoch. The model of ANFIS production fuzzy rules is presented in Figure 10. In Figure 11 are shown the surfaces of 3 input and 1 output variables.

Since we previously said that we avoid the strategy of holding stocks for a long time, our algorithm will be based on making a profit during one trading day. Although studies [11, 15, 19] indicate that good profits can be expected during the month, the indicators of a good financial report set the general trend in the dynamics of the company’s stock price chart. However, we still want to maximize the annual income of the portfolio by increasing trading operations with a small amount of invested money. This study is conducted under the investment strategy “bought today, sold today”. An analysis of the indicators of the goals and the ANFIS model is described in Table 4. When expecting income above 5%, data in excess of 5% of the output variable of the ANFIS model can be used.

Using ANFIS indicators, we make an assumption that if the output indicator is greater than 5%, then when buying a stock at the opening price, there is a good chance of getting 5% of the income when selling during the trading day. Next, we described the algorithm of the buying and selling process. Figure 12 and Table 5 describe a number of experiments that we conducted to prove our hypothesis. Tickers of companies that submitted financial reports were taken in the order of the calendar list of reports from the finance Yahoo website. The data were collected until ANFIS received 4 values greater than 5%.//function to get first trading price.fixOpenPrice()makeBuyOrder(ticker, openPrice, quantity)timerUntil 11.30 a.m. ET.checkStatus(buyOrder) if (buyOrder = complete)  then makeSellOrder(ticker, openPrice∗5%, quantity)  timerUntil 15.30 p.m. ET.   checkStatus(sellOrder)if (sellOrder = complete) Quit.else deleteSellOrder(Id) makeSellOrder(ticker, openPrice, quantity)//after 11.30 a.m. ET.deleteBuyOrder(Id)

A comparison of the training results of the proposed model with other works is described in Table 6 and Figure 13.

The proposed model has the lowest MSE and RMSE values, indicating better performance compared to the other models. Some models using optimization algorithms such as particle swarm optimization, firefly algorithm, genetic algorithm, and cat swarm optimization show competitive results. Traditional convolutional neural networks without denoising exhibit higher MSE and RMSE compared to other models. The table provides a comprehensive overview of model performances, with the proposed model standing out as particularly effective in minimizing prediction errors.

The proposed model stands out with the lowest MSE and RMSE values, indicating superior performance compared to the other models listed. GA-LSTM, DE-FLANN, GA-RBF, CSO-ARMA, PSO-ELMAN, PSO-MLP, BBO-MLP, and CSO-MLP also exhibit low MSE and RMSE values, suggesting competitive performance. SVM and BPNN show higher MSE and RMSE values, indicating comparatively less accurate predictions in the context of the given application. The provided figure offers a clear overview of the comparative performance of the proposed ANFIS model against various other models, highlighting its superior effectiveness in reducing prediction errors.

5. Discussion of Experimental Results

To support article’s hypothesis, the study conducted several experiments, as illustrated in Figure 12 and described in Table 5. These experiments aimed to validate the assumption by analyzing the performance of different tickers of companies that submitted financial reports. The results of the study generally show that our main hypothesis about how earning reports affect investor sentiment is correct; it has been proven that there is a correlation between the data of the financial report and the behavior of the stock price. Research also proves that it is possible to predict the price movement based on data from earning reports for monetary gain. The study showed that the most important parameters in the reports are the dynamics of EPS, revenue, and MaxW52 with analysts’ expectations. The main novelty lies in the fact that we used additional indicators, while researchers usually use only time series of price dynamics to train neural networks. It was also discovered that last year, the opening price of trading and the closing price of trading were strongly correlated with each other. These indicators are explained by the fact that 2021 was a good year in economic terms for many companies; many companies were on the rise after the lifting of restrictive measures in connection with COVID-19. It also indicates the absence of strong volatility during one trading day in 2021. And in 2022, companies are increasingly experiencing both a sharp rise in price and a sharp drop in price during one trading day. It has also been discovered that to a small extent, the closing price is negatively correlated with the yearly high price. That is, it can be concluded that if a company has submitted a bad report but the current price of the company is far from the price of a yearly maximum, then the drop will be minimal, and vice versa, if a company has submitted a bad report and is close to the price of a yearly maximum, then drop will be maximum. Moreover, if the company has submitted a good report but the current price of the company is far from the price of the yearly high price, then the growth will be maximum, and vice versa, if the company has submitted a good report and is close to the price of the yearly high price, then the growth will be minimal. The discovery for us was that the biotechnology company Veru Inc. (VERU) on December 5, 2022, submitted a very abnormally negative report and most likely, the price should have collapsed at the opening of trading. It should be taken into account that MaxW52 was an abnormal 346.3636364%. ANFIS predicted growth at 8%, but in reality, this company grew at +20% during the trading day and by the end of the day, closed trading at +7.7% even with negative report data. After analyzing the sample on which the neural network was trained, we came to the conclusion that abnormal large negative indicators have a good effect on the emotional reaction of traders. It is also possible to build a hypothesis for further research that the report indicators are processed by computational algorithms, perhaps even AI, so it is necessary to investigate the emergence of the concept of emotionality of trading robots that are managed by investment funds. Having understood the behavior of algorithms, it becomes possible to get ahead of them and sell the purchased securities to these algorithms. Also, the results of the study include a scrapper, a sample of financial indicators, a neural network model, ANFIS, and an algorithm for buying and selling. In future work, we plan to compare our approach with other well-known machine learning models used in the evaluation of income statements.

6. Conclusions

First, an adaptive neurofuzzy system architecture was designed, providing a robust framework for the integration of neurofuzzy techniques and adaptive learning algorithms. This architecture facilitated the creation of a powerful tool for processing and analyzing financial data. In addition, the study focused on correlating qualitative indicators within financial statements to filter training data, ensuring that only relevant financial indicators were considered during the learning process. By identifying and prioritizing these qualitative indicators, the neurofuzzy system was optimized to generate more accurate and reliable interpretations of corporate earnings reports. Furthermore, a predictive experiment was conducted to validate the results of the study and to establish a solid foundation for the development of financial strategies. This experiment involved applying the trained neurofuzzy system to predict future financial trends and outcomes based on historical data. By successfully confirming the accuracy and effectiveness of the system through this experiment, it was demonstrated that the neurofuzzy system has the potential to aid decision-making processes and inform financial strategies in the corporate domain. Developed system showcases its ability to interpret complex corporate earnings reports, offering valuable insights to financial professionals and decision-makers.

Data Availability

The data used in this study, including company earnings reports and relevant financial data, were sourced from publicly available sources and databases. These sources primarily include reputable financial news websites (https://finance.yahoo.com/and https://www.investing.com/), official company websites, and regulatory filings such as 10-K and 10-Q reports filed with the U.S. Securities and Exchange Commission (SEC). The dataset is publicly accessible and does not contain any confidential or proprietary information. Researchers interested in accessing the data utilized in this study can refer to the aforementioned publicly available sources and databases. Specific details on the datasets used, including the names of companies, publication dates, and any unique identifiers associated with the data, are provided in the Methods section of this article for clarity and reproducibility. The developed neurofuzzy system’s architecture, methodology, and implementation details are also described comprehensively to facilitate replication and verification of the results. For any inquiries regarding the data used in this study or further clarification on its availability, interested parties may contact the corresponding author.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was funded by the Science Committee of the Ministry of Education and Science of the Republic of Kazakhstan (Grant no. AP14869848).