English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 2872/3785 (76%)
造訪人次 : 3440760      線上人數 : 519
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋
    主頁登入上傳說明關於MUIR管理 到手機版


    請使用永久網址來引用或連結此文件: http://ir.meiho.edu.tw/ir/handle/987654321/3803


    題名: A Hybrid Approch for Sentiment Analysis on Textual Big Data – On the case of Financial Document Text Mining to Predict Stock-Index Trend
    作者: Chiang, Johannes K
    Huang, Grant
    Yeh, Ron Chuen
    貢獻者: 經營管理學院
    關鍵詞: Sentiment analysis
    Text Mining
    Latent Dirichlet Allocation (LDA)
    Supporting Vector Machine (SVM)
    NPMI-Concor
    日期: 2019-12
    上傳時間: 2020-01-03T03:22:39Z (UTC)
    摘要: Recently, there is an increasing interest in sentiment analysis and its applications in various fields. The purpose of this research is to analyze the potential emotion by means of sentiment analysis into large volume of financial texts to predict the stock-index trend. In the recent researches into sentiment analysis, supervised methods were proven to be able to reach the high accuracy of classification, but the input data of the methods should be pre-processed manually and they couldn’t discover the unknown classes. This research put forward a hybrid solution which combines supervised and unsupervised methods. Firstly, we introduce the unsupervised methods to find out the themes, viz. the topics within the documents. Then, we calculate the sentiment index to judge the document’s emotional direction. After that, we figured out which topics within the documents’ sentiment index are leading indicators in Taiwan’s electronic sub-index (TE). Finally, we used supervised methods by combining the sentiment index of leading indicators with the other 24 indirect sentiment indexes to build the classification model of TE. As a result, we found that Latent Dirichlet Allocation (LDA) method has a better clustering performance than that of TFIDF-Kmeans, and also outcomes a higher accuracy for clustering topics than the NPMI-Concor. By comparing sentiment index with MACD, i.e. a technical index that most investors of in the stock market use, we proved that the coincidence of sentiment index and TE is higher than that of MACD line and TE. We also found that the sentiment indexes from topics of business management and macro-economics are leading indicators and the prediction model of TE which comprises the sentiment index is better than that only comprises the technical indicators. At last, we employed supervised methods viz. kNN, Naïve Bayes, SVM, and Logistic regression, to build the classification models and found that SVM has the best performance with respect to precision, recall and f-measure. The study is the first of its kind thus far.
    顯示於類別:[企業管理系] 期刊論文

    文件中的檔案:

    沒有與此文件相關的檔案.



    在MUIR中所有的資料項目都受到原著作權保護.


    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回饋