Ανίχνευση μεταφορικών γλωσσικών φαινομένων με την χρήση Συνδυασμού Βαθιών Νευρωνικών δικτύων

Ποταμιάς, Ρολάνδος Αλέξανδρος; Potamias, Rolandos Alexandros

dc.contributor.author	Ποταμιάς, Ρολάνδος Αλέξανδρος	el
dc.contributor.author	Potamias, Rolandos Alexandros	en
dc.date.accessioned	2019-03-06T09:49:21Z
dc.date.available	2019-03-06T09:49:21Z
dc.date.issued	2019-03-06
dc.identifier.uri	https://dspace.lib.ntua.gr/xmlui/handle/123456789/48355
dc.identifier.uri	http://dx.doi.org/10.26240/heal.ntua.16523
dc.rights	Αναφορά Δημιουργού-Μη Εμπορική Χρήση-Όχι Παράγωγα Έργα 3.0 Ελλάδα	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/gr/	*
dc.subject	Μεταφορικά γλωσσικά φαινόμενα	el
dc.subject	Βαθιά μάθηση	el
dc.subject	Μηχανική μάθηση	el
dc.subject	Ειρωνεία	el
dc.subject	Σαρκασμός	el
dc.subject	Figurative language	en
dc.subject	Irony	en
dc.subject	Sarcasm	el
dc.subject	Deep learning	el
dc.subject	Machine learning	el
dc.title	Ανίχνευση μεταφορικών γλωσσικών φαινομένων με την χρήση Συνδυασμού Βαθιών Νευρωνικών δικτύων	el
dc.title	Figurative language detection using Deep Ensemble Classifier	en
heal.type	bachelorThesis
heal.classification	Τεχνητή νοημοσύνη	el
heal.classification	Επεξεργασία φυσικής γλώσσας	el
heal.classification	Natural language processing	en
heal.classification	Artificial intelligence	el
heal.language	el
heal.access	free
heal.recordProvider	ntua	el
heal.publicationDate	2018-10-03
heal.abstract	Αντικείμενο της διπλωματικής εργασίας είναι η ανάπτυξη μοντέλων αναγνώρισης μεταφορι- κών γλωσσικών φαινομένων(ΜΓΦ) με τεχνικές βαθιάς μηχανικής μάθησης (Deep Learning). Το πρόβλημα της αναγνώρισης και κατάταξης ΜΦΓ αποτελεί ένα ανοιχτό πρόβλημα της Συναισθηματικής Ανάλυσης στο πεδίο της Επεξεργασίας Φυσικής Γλώσσας λόγω της νοηματικής αντίθεσης που περιέχεται σε αυτά. Το πρόβλημα αυτό αποτελείται από την αναγνώριση τριών αλληλένδετων ΜΓΦ: του σαρκασμού, της ειρωνείας και της μεταφοράς, τα οποία, στα πλαίσια της παρούσας εργασίας, αντιμετωπίζονται με προηγμένες τεχνικές βαθείας μηχανικής μάθησης (RNN, LSTM) και με τεχνικές μηχανισμών διανυσματικής υποστήριξης (SVM). Αρχικά, διευρευενούνται μέσω εκτεταμένης βιβλιογραφικής έρευνας οι τεχνολογίες αιχμής (stat-of-the-art) και οι ερευνητικές εξελίξεις στην ανίχνευση και αναγνώριση ΜΓΦ και κατα- γράφονται οι σημαντικότερες προσεγγίσεις. Στην ανασκόπηση αυτή, δίνεται ιδιαίτερη έμφαση τόσο στην μέθοδο εξαγωγής χαρακτηριστικών όσο και στους αλγορίθμους μηχανικής μάθη- σης που χρησιμοποιούνται. Στην συνέχεια περιγράφονται συνοπτικά οι βασικές θεωρητικές αρχές πάνω στις οποίες βασίζεται η προτεινόμενη αντιμετώπιση του προβλήματος. Στην συνέχεια αναπτύσσεται το πλαίσιο και το στάδιο προεπεξεργασίας των σχετικών δεδομένων (από κοινωνικά δίκτυα, -tweets)με σκοπό τη βέλτιστη προετοιμασία τους πριν ει- σαχθούν στα μοντέλα βαθιάς μηχανικής μάθησης. Επιπρόσθετα, εξάγονται από τα δεδομένα χαρακτηριστικά που μπορούν να διαχωριστούν σε τέσσερις κατηγορίες: τα συντακτικά, εκ- φραστικά, συναισθηματικά και ψυχολογικά, καθένα από τα οποία αποτυπώνει πτυχές για την μέθοδο γραφής και εκφοράς λόγου του χρήστη των κοινωνικών δικτύων. Τέλος, δημιουργείται ένα πρωτότυπο μοντέλο Deep Ensemble Soft Classifier-DESC, που συνδυάζει αλγορίθμους βαθιάς μάθησης. Χρησιμοποιώντας τέσσερα διαφορετικά σύνολα δε- δομένων αναφοράς (benchmark data), από γνωστά και διαδεδομένα συνέδρια και σχετικούς διαγωνισμούς (Semantic Evaluation-SemVal), και εξαντλητική αξιολόγηση της ικανότητας αναγνώρισης, διακρίνουμε πως το μοντέλο DESC επιτυγχάνει πολύ καλή συμπεριφορά, άξια σύγκρισης με σχετικές μεθοδολογίες και τεχνολογίες αιχμής στο προκλητικό πεδίο της ανα- γνώρισης ΜΓΦ.	el
heal.abstract	The subject of the diploma thesis is the development of models for the recognition of figurative language (FL) utilizing deep learning techniques. The management, recognition and classification of FL is an open problem of Sentiment analysis in the broader field of natural language processing (NLP) due to the contradictory meaning contained in phrases with metaphorical content. The problem itself represent three interrelated FL recognition tasks: sarcasm, irony and metaphor which, in the present work, are dealt with advanced deep learning (Recurrent Neural Networks, LSTM) and support vector machine (SVM) techniques. Initially, the state-of-the-art technologies in the field of FL detection and recognition are being explored through extensive bibliographical research, and the most important approaches are documented. The emphasis of the review is placed on both the feature extraction methodologies and the machine learning algorithms being utilized. In the sequel, the basic theoretical principles and techniques, on which the proposed approach is based, are presented. Next, the prepossessing framework of the relevant social-media data (tweets) is pre- sented. Data prepossessing aims towards efficient data representation formats so that to optimize the respective inputs to the deep learning models. In addition, special features are extracted from the data in order to characterize the syntactic, expressive, emotional and temper content reflected in the respective social media text references. These features aim to capture aspects of the social network user’s writing method. Finally, a prototype, Deep Ensemble Soft Classifier-DESC is created which, is based on the combination of different deep learning techniques. Using four different sets of benchmark data-sets, from well-known and widespread conferences and related contests, and based on the assessment of the performance of different FL recognition approaches, we conclude that the DESC model achieves a very good performance, worthy of comparison with relevant methodologies and state-of-the-art technologies in the challenging field of FL recognition.	en
heal.advisorName	Σταφυλοπάτης, Ανδρέας-Γεώργιος	el
heal.advisorName	Stafylopatis, Andreas Georgios	en
heal.committeeMemberName	Στάμου, Γεώργιος	el
heal.committeeMemberName	Τσανάκας, Παναγιώτης	el
heal.academicPublisher	Εθνικό Μετσόβιο Πολυτεχνείο. Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών. Τομέας Συστημάτων Μετάδοσης Πληροφορίας και Τεχνολογίας Υλικών. Εργαστήριο Ευφυών Επικοινωνιών και Δικτύων Ευρείας Ζώνης	el
heal.academicPublisherID	ntua
heal.numberOfPages	96 σ.
heal.fullTextAvailability	true