Intent Mining of Thai Phone Call Text Using a Stacking Ensemble Classifier with GPT-3 Embeddings


Nattapong Sanchan

    Abstract:

    Intent mining has recently attracted Natural Language Processing (NLP) research communities. Despite the extensive research on English and other widely spoken languages, intent mining in Thai remains unexplored. This paper proposes an extended framework for mining intentions in Thai phone call text. It utilized a stacking ensemble method with GPT-3 embeddings, constructed by systematically determining based and meta-classifiers using Q-statistic and F1 scores. Overall, the based classifiers consisting of Support Vector Classier (SVC), k-nearest Neighbors (KNN), and Random Forest (RF) were derived with a meta-classier, Logistic Regression (LR). We compared the mining results, derived through the proposed Stacking Ensemble Classier (SEC), to 1) the individual base classifiers and 2) the three BERT baselines: BERT Multilingual Uncased, and BERT-th, and BERT Based EN-TH Cased. The results revealed that SEC could outperform SVC, KNN, RF, BERT Multilingual Uncased, and BERT-th, except BERT Based EN-TH Cased. However, a statistical analysis conducted using Friedman and Holm's post hoc tests reported no statistically significant difference between SEC and BERT Based EN-TH Cased, inferring that the two classifiers perform similarly practically.

    Keywords: Intent Mining, Intention Mining, Intent Classification, Intent Detection, Text Mining

    References:

    Download this paper: [PDF]
    Dowload bibtex entry: [BibTex]
    @article{2025_Sanchan, 
                        title={Intent Mining of Thai Phone Call Text Using a Stacking Ensemble Classifier with GPT-3 Embeddings}, 
                        volume={19}, 
                        url={https://ph01.tci-thaijo.org/index.php/ecticit/article/view/258239}, 
                        DOI={10.37936/ecti-cit.2025191.258239}, 
                        number={1}, 
                        journal={ECTI Transactions on Computer and Information Technology (ECTI-CIT)}, 
                        author={Sanchan, Nattapong}, 
                        year={2025}, 
                        month={Jan.}, 
                        pages={135–145}
          }
    Rich text bibliography entry (for copy & paste into a word processor):
    Sanchan, N. (2025). Intent Mining of Thai Phone Call Text Using a Stacking Ensemble Classifier with GPT-3 Embeddings. ECTI Transactions on Computer and Information Technology (ECTI-CIT), 19(1), 135–145. https://doi.org/10.37936/ecti-cit.2025191.258239