A PaLM-Embedding and SLIM-Driven Model for High-Precision Tourist Behaviour and Demand Prediction

Authors

  • Pulime Satyanarayana Author
  • Pedagadi Gangadhar Tilak Author
  • Addla Navaneeth Reddy Author
  • Chanda Ajay Author

DOI:

https://doi.org/10.64751/ijdim.2026.v5.n2(2).789

Keywords:

Tourism management, Natural Language Processing (NLP), High-Dimensionality Manifold, Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA), Histogram Gradient Boosting (HGB), SMOTE (Synthetic Minority Over-sampling Technique).

Abstract

The rapid digitization of the travel industry has generated vast repositories of unstructured customer reviews, yet extracting actionable intelligence for demand forecasting remains a complex challenge. Historically, tourism management relied on basic statistical models and time-series analysis to predict visitor influx. However, these traditional systems frequently fail to account for the nuanced sentiment and Behavioral shifts reflected in modern digital footprints. The core problem lies in the high dimensionality of natural language and the inherent class imbalance found in tourism datasets, where certain customer segments or extreme ratings are underrepresented. Conventional machine learning approaches, such as Linear Discriminant Analysis (LDA) and Quadratic Discriminant Analysis (QDA), often lack the architectural depth to process semantic context or handle skewed data distributions effectively, leading to suboptimal prediction accuracy. To address these limitations, there is a critical need for a framework that fuses deep linguistic understanding with robust statistical rebalancing. This research proposes GPS-Tourism, a hybrid high-precision model integrating Google Pathways Language Model (PaLM) embeddings, the Synthetic Minority Oversampling Technique (SMOTE), and a Sparse Linear Integer Model (SLIM) Classifier. In the proposed system, PaLM converts raw reviews into dense 768-dimensional semantic vectors, while SMOTE synthetically balances the training manifold to ensure minority behavioral patterns are not ignored. Finally, the SLIM architecture is implemented as an ensemble of oblique trees that executes the final classification of tourist ratings and demand segments. Experimental results demonstrate that this fusion significantly outperforms QDA, LDA, and Histogram Gradient Boosting (HGB) models. The significance of this research lies in its ability to provide tourism stakeholders with a granular, high-precision tool for resource allocation and personalized marketing, ultimately bridging the gap between qualitative sentiment and quantitative demand forecasting.

Downloads

Published

2026-04-24

How to Cite

Pulime Satyanarayana, Pedagadi Gangadhar Tilak, Addla Navaneeth Reddy, & Chanda Ajay. (2026). A PaLM-Embedding and SLIM-Driven Model for High-Precision Tourist Behaviour and Demand Prediction. International Journal of Data Science and IoT Management System, 5(2(2), 202-212. https://doi.org/10.64751/ijdim.2026.v5.n2(2).789

Similar Articles

201-210 of 614

You may also start an advanced similarity search for this article.