Έλεγχος της οδικής κυκλοφορίας με χρήση αλγορίθμων
ενισχυτικής μάθησης (Reinforcement Learning)

Αλκίδης, Ηλίας; Alkidis, Ilias

dc.contributor.author	Αλκίδης, Ηλίας	el
dc.contributor.author	Alkidis, Ilias	en
dc.date.accessioned	2015-01-23T08:17:48Z
dc.date.available	2015-01-23T08:17:48Z
dc.date.issued	2015-01-23
dc.identifier.uri	https://dspace.lib.ntua.gr/xmlui/handle/123456789/40093
dc.identifier.uri	http://dx.doi.org/10.26240/heal.ntua.7588
dc.rights	Default License
dc.subject	Ενισχυτική	el
dc.subject	Μάθηση	el
dc.subject	Προσομοίωση	el
dc.subject	Οδική κυκλοφορία	el
dc.subject	Reinforcement	en
dc.subject	Simulation	en
dc.subject	Learning	en
dc.subject	Traffic	en
dc.subject	Control	en
dc.title	Έλεγχος της οδικής κυκλοφορίας με χρήση αλγορίθμων ενισχυτικής μάθησης (Reinforcement Learning)	el
heal.type	bachelorThesis
heal.classification	Ευγυή Συστήματα	el
heal.classificationURI	http://localhost:8080/healp/data/3/9
heal.language	el
heal.access	free
heal.recordProvider	ntua	el
heal.publicationDate	2014-10-30
heal.abstract	Το κυκλοφοριακό ζήτημα είναι μείζονος σημασίας στις σύγχρονες αστικές κοινωνίες. Η δραματική αύξηση του αριθμού των ανθρώπων και των οχημάτων τις τελευταίες δεκαετίες στα αστικά κέντρα έχουν οδηγήσει σε καθημερινά φαινόμενα κυκλοφοριακής συμφόρησης. Στην παρούσα εργασία πραγματοποιήθηκε μια εκτενής μελέτη του προβλήματος της διαχείρισης της οδικής κυκλοφορίας και των μεθόδων που εφαρμόζονται για την επίλυσή του. Χρησιμοποιήθηκε ο αλγόριθμος Q-learning της ενισχυτικής μάθησης στην ε-greedy παραλλαγή του με σκοπό να εκπαιδευτεί κατάλληλα ένας πράκτορας και να ελέγξει μια διασταύρωση τεσσάρων κατευθύνσεων. Σαν σημείο αναφοράς χρησιμοποιήθηκαν δύο άλλες μέθοδοι ελέγχου της ίδιας διασταύρωσης – ένας αλγόριθμος σταθερού χρονισμού και ένας προσαρμοστικός. Η λύση που προτάθηκε από την εργασία αυτή παρουσίασε σημαντικές βελτιώσεις στην απόδοση σε σχέση και με τις άλλες δύο μεθόδους, δείχνοντας ότι ευφυείς τεχνικές και, ειδικότερα, της ενισχυτικής μάθησης είναι ιδιαίτερα κατάλληλες για την επίλυση προβλημάτων όπως το κυκλοφοριακό.	el
heal.abstract	Traffic control is of great importance in modern urban societies. The dramatic increase of population and vehicles during the last decades has led to daily occurrences of traffic congestion. In this thesis was conducted a thorough research of the issue of traffic control management and of the methods that are deployed for its solution. We used the Qlearning algorithm of reinforcement learning theory in its ε-greedy variation to effectively train an agent in controlling a four way intersection. As a baseline were used two conventional methods of control for the same intersection – a fixed time algorithm and an adaptive one. The solution that was proposed in this thesis has provided substantial improvements in performance in comparison to the other two methods, showing that intelligent techniques and specifically reinforcement learning are very well suited to solve problems such as traffic control.	en
heal.advisorName	Σταφυλοπάτης, Ανδρέας-Γεώργιος	el
heal.committeeMemberName	Κόλλιας, Στέφανος	el
heal.committeeMemberName	Στάμου, Γεώργιος	el
heal.committeeMemberName	Σταφυλοπάτης, Ανδρέας-Γεώργιος	el
heal.academicPublisher	Εθνικό Μετσόβιο Πολυτεχνείο.Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών. Τομέας Τεχνολογίας Πληροφορικής & Υπολογιστών	el
heal.academicPublisherID	ntua
heal.numberOfPages	85
heal.fullTextAvailability	true