Forecasting Movie Box Office Profitability

Marta Galvão 1 * , Roberto Henriques 1

J INFORM SYSTEMS ENG, Volume 3, Issue 3, Article No: 22.

OPEN ACCESS   1904 Views   1174 Downloads

Download Full Text (PDF) Cite this article


This study intends to estimate the profit of a movie through the construction of a predictive model that uses several Data Mining techniques, namely neural networks, regression and decision trees. The model will allow obtaining the prediction of box office revenue. Three different dependent variable approaches were used (interval, categorical and binary) aiming to study the difference and predictive influence that each one has on the results. Two metrics were used to determine the most accurate predictions: the misclassification error for the categorical models and the average squared error for the continuous one. In this study, the best predictive results were obtained through the use of multi-layer perceptron. Regarding the representative distinction between the dependent variable, the multiclass model presents a much higher error rate comparing to the remaining ones, which is explained with the increase of the number of classes to predict.


data mining, movie profitability, box office profit, predictive analysis, neural networks, decision tree, regression




Galvão, M., and Henriques, R. (2018). Forecasting Movie Box Office Profitability. Journal of Information Systems Engineering & Management, 3(3), 22.

Submit a Manuscript