HEAL DSpace

Maritime Accident Prediction with Machine Learning

Αποθετήριο DSpace/Manakin

Εμφάνιση απλής εγγραφής

dc.contributor.author Σπαθιάς, Γεώργιος el
dc.contributor.author Spathias, Georgios en
dc.date.accessioned 2023-01-26T09:13:50Z
dc.date.available 2023-01-26T09:13:50Z
dc.identifier.uri https://dspace.lib.ntua.gr/xmlui/handle/123456789/56923
dc.identifier.uri http://dx.doi.org/10.26240/heal.ntua.24621
dc.rights Default License
dc.subject Machine Learning en
dc.subject Supervised Learning el
dc.subject Classification algorithms el
dc.subject Maritime accidents prediction el
dc.subject Python el
dc.subject Μηχανική Μάθηση el
dc.subject Πρόβλεψη ναυτικών ατυχημάτων el
dc.subject Ανάλυση δεδομένων el
dc.subject Μοντέλα πρόβλεψης el
dc.subject Τεχνητή Νοημοσύνη el
dc.title Maritime Accident Prediction with Machine Learning en
dc.title Ανάπτυξη Μοντέλου Πρόβλεψης Ναυτικών Ατυχημάτων με τη χρήση Αλγορίθμων Μηχανικής Μάθησης el
dc.contributor.department Division of Ship Design and Maritime Transport el
heal.type bachelorThesis
heal.classification Naval - Research engineering en
heal.language el
heal.language en
heal.access campus
heal.recordProvider ntua el
heal.publicationDate 2022
heal.abstract The present thesis aims to predict information about maritime accidents by using machine learning models. Maritime accidents affect the smooth operation of the maritime industry. Their consequences are important because they lead to human deaths, injuries and environmental disasters. Hence, it is important to reduce the dangers and try to minimize the number of accidents. Accident records from two different databases are collected, cleaned and modified accordingly in order to fit machine learning models. The data offer information about maritime accidents and their outcome. The type of the accidents, their severity and all their consequences are the output features, while, information about the vessels and the circumstances during the accident represent the input features. Data modification and their preparation for the models is one of the most important procedures in this thesis. Databases quality and the existence of data imbalance is one of the main factors that affect the final results. The existence of 2 different databases allows their comparison and helps to understand which one of them is ideal and why. Machine learning is a subfield of Artificial Intelligence. This thesis implements 10 different machine learning models that are commonly used for supervised learning (MLP, Logistic Regression, Gaussian NB, Random Forest, Decision Trees, AdaBoost, Gradient Boosting, Hist Gradient Boosting, XGBoost, LGBM). These models are being used because they are ideal for classification (the output features have categorical classes) and they are also used in similar studies. The models are based on neural networks, statistics, decision trees and boosting. A wide variety of models is being used in order to compare their results and understand which one is ideal for similar projects. The main goal of this thesis is to predict the type of an accident. Given that this feature is encoded to have 4 classes, their weighted f1-score is 52%. This result is the optimal out of all the studied cases. It occurs from the first database, which has more features that the second one, but it contains less accident records. Also, the result occurs from random forest classifier which along with the boosting-based algorithms (Gradient Boosting, Hist Gradient Boosting, XGBoost, LGBM) produce the best results. en
heal.advisorName Βεντίκος, Νικόλαος el
heal.advisorName Ventikos, Nikolaos P.
heal.committeeMemberName Ginnis, Alexandros
heal.committeeMemberName Anyfantis, Konstantinos
heal.academicPublisher Εθνικό Μετσόβιο Πολυτεχνείο. Σχολή Ναυπηγών Μηχανολόγων Μηχανικών el
heal.academicPublisherID ntua
heal.numberOfPages 170 σ. el
heal.fullTextAvailability false


Αρχεία σε αυτό το τεκμήριο

Αυτό το τεκμήριο εμφανίζεται στην ακόλουθη συλλογή(ές)

Εμφάνιση απλής εγγραφής