• 沒有找到結果。

CHAPTER 2 Related Work and Background Knowledge

2.5 Investor Sentiment Index

In 1990, DeLong et al. proposed the noise trader model [10]. The model considers that in the market there are some noise traders who give not only excessive response but the low response to information, which directly affects the stock price equilibrium and result to market risk rise. Investor sentiment is one of the irrational factors that cause the transaction of the noise. It represents the subjective judgment of the investor based on his attitude to the future stock price rising or falling. Hence, many studies have begun to look for emotional factors that can reflect the future changes in the stock market for traders. Schmeling used a new data set on investor sentiment to show that institutional and individual sentiment proxy for smart money

and noise trader risk.

In 2001, Brown et al. indicated that if investors believe that the actual value of the stock is higher than (below) the current price, their mentality will tend to be optimistic (pessimistic) [2]. The authors also found that investor sentiment will be affected by past stock returns and the degree of emotional volatility is also highly correlated with the stock returns in the same period. Schmeling used a new data set on investor sentiment to show that institutional and individual sentiment proxy for smart money and noise trader risk [36]. Lee et al. found that there is a close relationship between investor sentiment, market volatility and excess return [27].

The following describes the research related to investor sentiment indices. In 1998, Neal et al. researched the three investor sentiment indices including the degree of closed-end funds discount, odd-lot ratio, and net redemptions for forecasting stock returns [31]. They found that the degree of closed-end funds discount and net redemption are predictable for small companies, and the odd-lot ratio has no predictability for either large or small company. In 2000, Barber et al. considered that there is a strong negative relationship between the IPO rate and the following year's market returns [6]. This relationship can provide a stronger market return prediction. In 2012, Wang et al. pointed out that there is a high correlation between the three retail sentiment variables including the buy-sell imbalance, the individual investor turnover and the proportion of day-trades [41]. This result indicated that these three variables are likely to capture similar retail investment behaviors.

CHAPTER 3

Proposed Approach

In this section, we will introduce our proposed approach, which uses two investor sentiment indices in the grouping genetic algorithm to obtain a diverse group stock portfolio. We first design some rules based on two sentiment indices to decide the buying and selling points of all the input stocks. These stock price data with buying and selling points are then sent to the proposed based approach for obtaining a best group stock portfolio. The proposed GGA-based approach will use the sentiment indices and the other factors in its fitness to evaluate the chromosomes (possible group stock portfolios). After a predefined number of generations, the chromosome with the best fitness value at the last generation is output as the diverse group stock portfolio. The flowchart of the proposed approach is given in Fig. 1.

Fig. 1. The flowchart of the proposed approach.

Fig. 1 shows that the input data is preprocessed by trading strategy generated from investor sentiment index and trading signal. And, it puts the preprocess data into the grouping genetic algorithm to produce the best derived diverse GSPs with investor sentiment index. The components of the proposed approach are described as follows.

3.1 Data Preprocessing: Trading Strategy

For considering the sentiment factor in the proposed GGA-base approach for finding a good group stock portfolio, we first transform the input stock price sequences to the ones with sentiment considered. We design some rules based on two sentiment indices to decide the buying and selling points of all the input stocks.

Most of the portfolio research topics focus on how to find a high-risk and low-risk portfolio; however, little research is proposed to provide users with trading time points of the stock portfolio. In this study, we provide users with trading timings for each trading period. For instance, we assign the trading period from 2014 to 2016. With this research, we accurately informed consumers to buy the stock on February 3, 2014, and sell it on December 24, 2016.

What is more, this study still retains the advantages of high-risk and low-risk of traditional stock portfolios.

In this study, the decision rules for determining the timing of buying or selling the stocks are based on the investor sentiment index. In the previous chapter, there are many indices that can reflect the investor's sentiment, called investor sentiment index. After filtering, we choose the research [41] to provide investor sentiment indicators as the basis for this study. There are three reasons for selecting the investor sentiment indices. First, the sentiment indices provided by the paper apply to the Taiwan stock market and it consists with the data used in this paper.

Due to the different rules of the transaction, the floating price of the stock in different regions

will be different, and the impact of investor sentiment indices will also be affected. Therefore, selecting the same range of research as our paper is important. Second, the formulas for investor sentiment indices are clear and easy to understand. Too complicated formulas not only difficult to directly express its meaning but also cause the overlong operation time. Third, the indices are designed based on the behavior of retail investors. In Taiwan, retail investors' trading volume accounts for 70% of Taiwan's overall trading volume for the stock market. Therefore, studying Taiwan's retail investors' behavior is more meaningful and persuasive. Next, we introduce the two investor sentiment indices used in this paper to reflect the investor's sentiment.

The first index is called Buy-Sell Imbalance (BSIt), which is defined as follows:

,

The parameters in this formula are as follows: BMLt is the balance of margin loan of the stock on the day t; BSLt is the balance stock loan of the stock on the day t; Vt is the trading volume of the stock on the day t; BIt is the buying volume of institutional investors of the stock on the day t; and SIt is the selling volume of institutional investors of the stock on the day t.

With these defined parameters, BSIt means the value of buy-sell imbalance of the stock on the day t, VBt means the retail investors' buying volume of the stock on the day t, and VSt means the retail investors' selling volume of the stock on the day t.

Then we explain the proper terms used. The margin loan is an investor’s behavior which means investor borrows money from securities dealer to buy the stock. In Taiwan, the securities

dealer will borrow 60% of the value of the stock to investors with good credit to purchase the stock. For example, we assign that the price of the stock is one hundred, and the value of the stock is one hundred thousand. It means that investors can only use forty thousand to hold the stock by margin loan. In general, investors expect that stock price will rise in the future, but the hands of the funds is not enough; then they pay a part of the margin to securities dealer to borrow money for buying stock, and then wait for an opportunity to sell the stock at high prices and earn low selling low spreads. In contrast to a margin loan, the stock loan means investors borrow stock from securities dealer to sell it. If investors expect that stock price will fall in the future, they can borrow stock from securities dealer to sell by the stock loan. The trading volume represents the total number of transactions including institutional investors and retail investors, where institutional investors include foreign investment institutions, investment trusts, and dealers. The retail investors represent the individual investor. Retailers are usually composed of low-income individual investors and characterized by less investment per person.

Next is an analysis the formula of BSIt. Above all, we focus on the fractions. On a stock at the time t, the higher of the BMLt means the higher volume of investor borrowing money to buy the stock. This phenomenon can be interpreted as the degree of investors expecting the stock to turn high is increasing. On the other hand, the higher the BSLt means the higher volume of investor borrowing stock to sell the stock. Therefore, if the difference between BMIt and BSLt is high, it means the degree of investors expecting the stock to rise is high too; in other words, investors are optimistic about the stock. If the difference between BMIt and BSLt is low, it means investors are pessimistic about the stock. Then, we focus on the denominator. The sum of VBt and VSt means the overall retail investors' trading volume of the stock at the time t. To sum up, at time t, higher BSIt value means that the retail investors are more optimistic about the stock.

The second investor sentiment index is called proportion of Day-Trades (DTt), which is

defined as follows:

,

t t t Shares

DTNDT (4)

where NDTt is the number of day-trades and Sharest is the outstanding shares. In the stock market, the number of day-trades (NDTt) means investors’ twice trading behaviors (including buy after sell or sell after buy) by margin loan and stock loan on the same day t on the same stock. For example, if a man buys (sells) a share of TSMC on December 12, 2016 by margin (stock) loan and sells (buys) it by stock (margin) loan on the same day, the number of day-trades of TSMC on December 12 adds one. The Sharest means the amount of stock that the company offers to investors on the day t. Generally, the magnitude of the Sharest change is extremely small. Only when the company makes a big change, the Sharest will be different.

Therefore, the Sharest is used to normalize NDTt.

Next is an analysis of the formula of DTt. When the DTt of the stock turns high, it means more and more investors day-trade the stock on the same day t. Generally, most of the investors do not day-trade, only those who are particularly interested in the stock will do so. Thus it can be seen that there is a positive relation between DTt and the degree of investors’ interest in the stock. However, the value of DTt does not represent the investor expecting the stock price to rise or fall. For instance, if investors are expecting the stock price to rise, they may buy a stock by margin loan and sell it by stock loan on the same day, and that causes the DTt of the stock to rise. Then, if investors are expecting the stock price to fall, they may sell a stock by stock loan and buy it by margin loan on the same day, and this behavior still causes the DTt of the stock to turn high.

With these two investor sentiment indices, we divide the investor sentiments on a stock into six cases as follows:

Table 1. Six cases for the two investor sentiment indices.

In Table 1, the investor sentiments are classified into six cases by the status of BSIt and the change of DTt. We use the six cases in Table 1 to judge the stocks on every trading day. For example, Case II reflects the active attitude of the investor on a stock. However, it does not give the rising or falling expectation of investors. Cases IV, V and IV reflect the investors holding the passive attitude to a stock. However, the passive attitude doesn’t affect the stock price. Therefore, Cases II, IV, V and VI can be ignored. On the contrary, the active attitude of investors to a stock will affect the stock price. Thus, we use Cases I and III to develop the

For distinguishing the status of BSIt, the BSI thresholds are defined. The BSI thresholds include Low BSI threshold and High BSI threshold as follows:

Low BSI threshold = PR10(BSIt),

High BSI threshold = PR90(BSIt),

where PR10(BSIt) is the value which is at the PR10 from all 𝐵𝑆𝐼t value, and PR90(BSIt) is the value which is at the PR90 from all 𝐵𝑆𝐼t value. The PR (percentile rank) value is a percentage of scores in its frequency distribution that are equal to or lower than it. It is mostly used in the examination results above the rankings. For example, a score that is greater than or equal to

50% of the scores of people is said to be at the 50th percentile, where 50 is the percentile rank.

To practice the PR value, two steps should be implemented. They are assigned a string of numbers S as follows:

S = {12, 8, 2, 1, 17, 15, 4, 18, 9, 3, 11, 16, 10, 19, 5, 20, 14, 7, 13, 6}.

First, the string S is sorted from small to large. Then the new string S' is as follows:

S' = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20}.

Then, we use the formula to calculate the PR value of every element in the string S'. The formula is as follows:



 

N

PRvaluei 100Ri 50

100 , (5)

where Ri is the rank of element i, and N is the number of elements in the string S. Finally, we get the PR value of every element in the string S. In this case, N is 20. The PR value of the element ‘2’ is 7, called PR7(S), because the rank of ‘2’ is 19. Similarly, proving that ‘3’ is PR12(S), ‘4’ is PR17(S), …, ‘20’ is PR97(S). PR10(S) cannot be calculated due to the small number of elements in the string S. Therefore, we choose the element which is larger than PR10(S) and the closest one to PR10(S) to be the Low BSI threshold. The element ‘3’ is the Low BSI threshold. Similarly, we choose the element ‘19’ to be the High BSI threshold. With the Low BSI threshold and the High BSI threshold, it is defined that the High BSIt is the BSIt

which islarger than High BSI threshold, and the Low BSIt is the BSIt which is smaller than Low BSI threshold.

Following that, we introduce the trading strategy in this proposed approach. Before defining the trading strategy, we combine Case I and Case III with buying and selling 46 company shares for finding the rules which maximize the stock return. The combination1 is, if Case I is true, buy the stock and if Case III is true, sell it. The combination2 is, if Case III is true, buy the stock and if Case I is true, sell it. Compare combination1 with combination2, find combination2, and make the overall stock more profitable than combination1. The reason for this phenomenon is likely to be that the investor sentiment experienced the lowest point is likely to rise and then drive the growth of stock prices. After the consolidation of the above information, the trading strategy is set up of this study. The trading strategy is composed of two rules as follows:

Rule 1: On a day t, if a stock’s BSIt is Low and DTt is rising, then buy the stock.

Rule 2: On a day t, if a stock’s BSIt is High and DTt is rising, then sell it.

Using the defined rules, the trading points are added to the original data. In this study, we define that the number of transaction on every stock is one. Therefore, we choose the first buying point and the last selling point to be the trading timing on this stock. By following this investor sentiment method, we get the Buy price and Sell price on every stock in the input data.

3.2 Components of the Proposed Approach

3.2.1 Encoding Scheme

We use a set S to represent n stocks, and denote them as {s1, s2, …, sn}. The number of the groups is K. Then, the encoding scheme of the chromosome is defined as follows in Fig. 2.

Fig. 2. The encoding scheme of chromosome.

In Fig. 2, the chromosome consists of three parts including the grouping part, stock part, and stock portfolio. The number of groups in the grouping part is K. The results of the stock grouping are expressed in stock part. The sab means the stock is the a-th element in the group b, where G1G2…Gk = {s11, s21, ..., sh1} {s12, s22, ..., sm2}  ...  {s1k, s2k, ..., sfk} = S, Gi

  and i  j, Gi  Gj = . Only one stock can be selected to compose the stock portfolio.

Therefore, it means that only K stocks may be selected from groups into the stock portfolio. In the stock portfolio, we use two genes to associate to each group. The two genes are bi and ui, where bi is the threshold for deciding whether or not to buy the stock selected in the group Gi

and ui is the number of units purchased. If bi is greater than or equal to 0.5, it means a stock in group Gi will be purchased ui units as part of a stock portfolio. Next, we use an example to explain the encoding scheme.

Example 1: There are ten stocks s1, s2, s3, ..., s10 and the number of group K is 3. The chromosome is encoded as shown in Fig. 3:

Fig. 3. An example of encoding scheme.

In Fig. 3, ten stocks {s1, s2, s3, ..., s10} are divided into three groups G1,G2 and G3. The allocation of stocks is {s1, s3, s9}for G1, {s4, s8, s10}for G2 and {s2, s5, s6, s7}for G3. Because the values of b1 and b3 are larger than 0.5, it means that two stocks will be selected from G1

and G3 to compose a stock portfolio. To be more precise, one stock will be selected from {s1, s3, s9} with two purchased units, and the other one will be selected from {s2, s5, s6, s7} with four purchased units. In this example, it means that there are twelve (= 3*4) stock portfolios generated by this chromosome.

3.2.2 Initial Population

The strategy of generating initial population is important, because it reflects on the final optimization result. You et al. pointed out that a portfolio with cash dividend yield is better than a portfolio with types in the Taiwan stock market [39]. Therefore, choose cash dividend yields of stocks to frame the initial population in this proposed approach. Next, we use an example to explain this method for explaining the benefits of using cash dividend yield to generate initial population shown in Table 2. In Table 2, it shows cash dividends, stock price,

and cash dividend yield of two companies named Hon Hai Precision Industry Co., Ltd. (2317) and Formosa Petrochemical Corporation (6505).

Table 2. Cash dividend yields of two companies.

2011 2012 2013 2014 2015

Cash dividends of

2317 (per share) 1.5 1.5 1.8 3.8 4

Stock price of 2317 82.90 88.90 80.10 87.90 80.80 Cash dividend

yield of 2317 1.81% 1.69% 2.25% 4.32% 4.95%

Cash dividends of

6505 (per share) 2 0.26 2.5 0.85 4

Stock price of 6505 93.80 86.00 81.80 68.70 78.80 Cash dividend

yield of 6505 2.13% 0.30% 3.06% 1.24% 5.08%

In Table 2, the cash dividends of 2317 are 1.5, 1.5, 1.8, 3.8 and 4, the stock prices of 2317 are 82.90, 88.90, 80.10, 87.90 and 80.80. Then, we calculate the cash dividend yields defined as that cash dividend divided by stock price. Therefore, the cash dividend yields of 2317 are 1.81%, 1.69%, 2.25%, 4.32% and 4.95%. In the same way, the cash dividend yields of 6505 are 2.13%, 0.30%, 3.06%, 1.24% and 5.08%. To compare the stability of 2317 and 6505, the standard deviation of each cash dividend yields is calculated. The standard deviation of 2317 is 1.35938, and the standard deviation of 6505 is 1.63947. As a result, we know 2317 is better than 6505 because the cash dividend yield of 2317 is stable. It means that buying 2317 can earn a stable income at low risk. Then, the cash dividend yields (yi) of n companies are shown in Table 3.

Table 3. The cash dividend yield (yi) of companies.

s1 s2 si sn-1 sn

y1 y2 yi yn-1 yn

With the cash dividend yield (yi) of n companies, we use some existing techniques like

With the cash dividend yield (yi) of n companies, we use some existing techniques like

相關文件