HEAL DSpace

Semantic segmentation for ADL in assistive robotics and comparison of RGB/RGB-D input and Single/Multiple viewpoints

Αποθετήριο DSpace/Manakin

Εμφάνιση απλής εγγραφής

dc.contributor.author Οικονόμου, Αικατερίνη Μαρία el
dc.contributor.author Oikonomou, Aikaterini Maria en
dc.date.accessioned 2021-04-19T14:43:28Z
dc.date.available 2021-04-19T14:43:28Z
dc.identifier.uri https://dspace.lib.ntua.gr/xmlui/handle/123456789/53356
dc.identifier.uri http://dx.doi.org/10.26240/heal.ntua.21054
dc.description Εθνικό Μετσόβιο Πολυτεχνείο--Μεταπτυχιακή Εργασία. Διεπιστημονικό-Διατμηματικό Πρόγραμμα Μεταπτυχιακών Σπουδών (Δ.Π.Μ.Σ.) “Συστήματα Αυτοματισμού”
dc.rights Default License
dc.subject Semantic segmentation en
dc.subject Computer vision en
dc.title Semantic segmentation for ADL in assistive robotics and comparison of RGB/RGB-D input and Single/Multiple viewpoints en
heal.type masterThesis
heal.classification Computer vision en
heal.language el
heal.language en
heal.access free
heal.recordProvider ntua el
heal.publicationDate 2021-02-01
heal.abstract Computer vision, the field of artificial intelligence that develops computational algorithms to interpret and understand the visual world, has been studied in many perspectives. It expands from raw data recordings into techniques and ideas combining digital image processing, pattern recognition, machine learning and computer graphics. As its extensive usage has attracted many scholars to integrate with many disciplines and fields, in assistive robotics, computer vision aims to develop systems to help robots understand images in a similar way humans do. In this thesis, we focus on semantic segmentation, the process of labelling each area–or pixel– of a digital image according to a representation class. We make use of deep learning methods, which, over the last years, have been shown to outperform previous state-of-the-art machine learning techniques in several fields, with computer vision being one of the most prominent cases. Deep learning models can be trained to identify and classify objects in an image, and, in particular, Convolutional Neural Networks (CNN), a class of deep neural networks most commonly used in visual imagery, has been shown to be able to extract important visual features that resemble the ones generated in the visual cortex of humans. In particular, we develop a novel convolutional neural network architecture for semantic segmentation in the framework of Activity of Daily Life (ADL), with focus on semantic segmentation of images in a breakfast scenario. We use RGB and RGB-D images as input signals (images with 4 channels: RGB colors and Depth) and quantify the benefits of the additional information of depth in object classification. As a next step, we evaluate the performance of our deep neural network when trained with data from multiple viewpoints. We make use of two cameras pointing on the table with a different field of view. The first camera is placed on a headset (user’s point of view) and the second on the table.Furthermore, for this project we create a new dataset using two Intel RealSense camera sensors, in the apparatus mentioned above. This dataset is used to train three different convolutional neural networks for semantic segmentation. The first two CNNs are trained with the dataset collected from the headset (one with RGB and one with RGB-D information). The third CNN is trained with the dataset collected from the second camera, and the last one with the dataset from both camera views. Quantifying the importance of multiple viewpoints and depth information in semantic segmentation will help in developing robust, fast, and more precise perception algorithms for object recognition in ADL applications. This is an important step towards the final goal of creating assistive robot agents (e.g. robotic arms, wheelchairs,and exoskeletons) that can help people with severe motor impairments enjoy their first meal of the day. en
heal.advisorName Kyriakopoulos, Kostas J. en
heal.advisorName Graeser, Axel en
heal.committeeMemberName Κυριακόπουλος, Κωνσταντίνος Ι. el
heal.committeeMemberName Τζαφέστας, Κωνσταντίνος Σ. el
heal.committeeMemberName Παπαδόπουλος, Ευάγγελος Γ. el
heal.academicPublisher Εθνικό Μετσόβιο Πολυτεχνείο. Σχολή Μηχανολόγων Μηχανικών el
heal.academicPublisherID ntua
heal.numberOfPages 46 σ. el
heal.fullTextAvailability false


Αρχεία σε αυτό το τεκμήριο

Αυτό το τεκμήριο εμφανίζεται στην ακόλουθη συλλογή(ές)

Εμφάνιση απλής εγγραφής