With the enormous growth rate in the number of movies coming into our lives, it can be very challenging to decide whether a movie is suitable for a family or not. Almost every country has a Movie Rating System that determines movies’ suitability age. But these current movie rating systems require watching the full movie with a professional. In this paper, we developed a model which can determine the rating level of the movie by only using its subtitle without any professional interfere. To convert the text data to numbers, we use TF-IDF vectorizer, WIDF vectorizer and Glasgow Weighting Scheme. We utilized random forest, support vector machine, k-nearest neighbor and multinomial naive bayes to find the best combination that achieves the highest results. We achieved an accuracy of 85%. The result of our classification approach is promising and can be used by the movie rating committee for pre-evaluation.
Cautionary Note: In some chapters of this paper may contain some words that many will find offensive or inappropriateness; however, this cannot be avoided owing to the nature of the work
machine learning deep learning natural language processing nlp subtitles movie ratings parental guidelines
Birincil Dil | İngilizce |
---|---|
Konular | Mühendislik |
Bölüm | Tasarım ve Teknoloji |
Yazarlar | |
Erken Görünüm Tarihi | 14 Mart 2023 |
Yayımlanma Tarihi | 25 Mart 2023 |
Gönderilme Tarihi | 21 Temmuz 2022 |
Yayımlandığı Sayı | Yıl 2023 Cilt: 11 Sayı: 1 |