Υλοποίηση Αλγόριθμου Συστάσεων που με χρήση Τριγραμμάτων

Βιόλος, Ιωάννης Α.; Violos, Ioannis A.

dc.contributor.advisor	Βαρβαρίγου, Θεοδώρα	el
dc.contributor.author	Βιόλος, Ιωάννης Α.	el
dc.contributor.author	Violos, Ioannis A.	en
dc.date.accessioned	2014-04-01T08:06:40Z
dc.date.available	2014-04-01T08:06:40Z
dc.date.copyright	2013-12-17	-
dc.date.issued	2014-04-01
dc.date.submitted	2013-12-17	-
dc.identifier.uri	https://dspace.lib.ntua.gr/xmlui/handle/123456789/38273
dc.identifier.uri	http://dx.doi.org/10.26240/heal.ntua.16871
dc.description	166 σ.	el
dc.description.abstract	Τα Κοινωνικά Δίκτυα είναι ένας διαρκώς αυξανόμενος επιστημονικός κλάδος. Υλοποιούνται προγραμματιστικά με έναν συνδυασμό Linux, Apache, MySQL και PHP/Python/Perl (LAMP) και συνδυάζουν τεχνολογίες όπως Στοιχεία Αξιολόγησης, Συνεργατικές Ετικέτες, Κοινωνικοί Σελιδοδείκτες, Επεξεργασία Κοινωνικών Πληροφοριών, Συσχετισμός Πελατών, Εξατομικευμένο Marketing, Συστήματα Υπόληψης, Αυτοματοποιημένες Περιλήψεις, Συστήματα Συστάσεων. Εμείς ασχοληθήκαμε αναλυτικά με τα Συστήματα Συστάσεων. Δηλαδή αλγορίθμους και μεθόδους που μπορούν να κάνουν συστάσεις σε χρήστες. Οι συστάσεις μπορούν να είναι άλλοι χρήστες, κάποιο προϊόν, υπηρεσία, δημοσιογραφικά άρθρα και γενικώς οτιδήποτε μπορεί να ενδιαφέρει έναν χρήστη. Διάσημοι τέτοιοι αλγόριθμοι είναι ο αλγόριθμος που χρησιμοποιεί Συνεργατικά Φίλτρα όπου ο σκοπός του είναι να δημιουργήσουμε προφίλ χρηστών για τις προτιμήσεις τους καθώς και σχέσεις μεταξύ χρηστών. Η ιδέα βασίζεται στην αρχή ότι αυτό που ενδιέφερε τους φίλους μου πολύ πιθανόν να ενδιαφέρει και εμένα. Τα Φίλτρα που Βασίζονται στο Περιεχόμενο χρησιμοποιούν την περιγραφή ενός αντικειμένου αλλά και το προφίλ ενδιαφερόντων που έχει κάθε χρήστης. Η ιδέα βασίζεται στο ότι ένας χρήστης έχει ένα πλήθος ενδιαφερόντων και κάθε αντικείμενο ικανοποιεί ένα πλήθος ενδιαφερόντων. Όσο πιο πολύ ταυτίζονται αυτά τα δύο τόσο πιο μεγάλο θα είναι το ενδιαφέρον του χρήστη προς το αντικείμενο αυτό. Άλλα Συστήματα συστάσεων κάνουν χρήση τεχνικών Εξόρυξης Γνώσης, Παραπομπές, χρήση του Αλγόριθμου k Κοντινότερων Γειτόνων. Άλλα βασίζονται στο πόση Ώρα κάθε χρήστης είδε μια ιστοσελίδα, σε Ετικέτες, Συσχετιστικούς Κανόνες, υβρίδια μεταξύ όλων αυτών και σε συστάσεις που χρησιμοποιούν την Αυτοματοποιημένη Περίληψη Πολλών Κειμένων μία μέθοδο στην οποία μπορεί να παραχθεί περίληψη από πολλά κείμενα που έχει γράψει κάθε χρήστης και έπειτα να συγκριθούν αναμεταξύ τους. Το Σύστημα συστάσεων που υλοποιήσαμε βασίζεται στα κείμενα που δημοσιεύουν οι χρήστες. Από καθενός χρήστη τα κείμενα παράγεται ένας γράφος. Οι κόμβοι αυτού του γράφου θα είναι τα τριγράμματα που βρίσκονται στα αρχικά κείμενα και οι ακμές δηλώνουν του κατά πόσο αυτά τα τριγράμματα είναι κοντά αναμεταξύ τους στα αρχικά κείμενα. Αν συγκρίνουμε τους γράφους που έχουν παραχθεί για κάθε χρήστη θα έχουμε μια ένδειξη του κατά πόσο έχουν κοινά ενδιαφέροντα αναμεταξύ τους οι χρήστες. Υψηλή ομοιότητα γράφων θα συνεπάγεται ότι μοιράζονται πολλά κοινά ενδιαφέροντα. Οι μέθοδοι σύγκρισης γράφων που χρησιμοποιήσαμε ήταν η ομοιότητα περιεχομένου, όπου μας νοιάζει η ομοιότητα του γράφου χωρίς να παίρνουμε υπόψιν το βάρος των ακμών και η ομοιότητα κατά αξία όπου προσμετράται και κατά πόσο δύο κοινές ακμές έχουν το ίδιο βάρος. Τα αποτελέσματα που βρήκαμε ήταν αρκετά ικανοποιητικά σε δοκιμές που έγιναν στο τουίτερ. Στις περισσότερες περιπτώσεις είχαμε υψηλή ομοιότητα μεταξύ χρηστών που ασχολούνται με κοινά θέματα. Χαμηλή σε χρήστες που ασχολούνται με διαφορετικά θέματα και τέλος προβλέφθηκε η σχέση μεταξύ χρηστών που ασχολούνται με συγγενικά θέματα.	el
dc.description.abstract	Social Networks is an increasing scientific field . We can implement it with a combination of Linux, Apache, MySQL and PHP / Python / Perl (LAMP) and combine technologies such as Rating Features, Collaborative Tagging, Social Bookmarking, Social Information Processing, Customer Engagement, Personalized Marketing, Reputation Systems, Automated Summaries, Recommendation Systems . We dealt in detail with Recommendation Systems . Algorithms and methods that can make recommendations to users . Recommendations may be other users , any product, service, newspaper article and generally anything that might interest a user . Famous algorithm is the algorithm that uses Collaborative Filters where the purpose is to create user profiles for their preferences and relationships between users. The idea is based on the principle that me and my friends share the same interests. The Content Based methos combines the description of an object and the profile of interests that each user has . The idea is that a user has a number of interests and each object has a number of interests that satisfies. As much matching these two the greater will be the user's interest in that subject . Other Recommendation systems use Data Mining techniques , Referral Web, using the kNearest Neighbor algorithm. Others based on the Time View, in Tags , Referral Web , hybrids between the previous and the MultiDocument Summarization which is a method to be produced a summary of many texts which have been written by users and then be compared against each other . The recommendation system which we implemented is based on texts by users . A graph is produced by each user texts . The nodes of this graph is the trigrams that found in the original texts and edges indicate whether these Trigrams are near each other in the early texts. If we compare the graphs generated for each user we will have a prediction if there are common interests among the users . High similarity between graphs would imply that the users share many common interests. The Comparison Methods of Graph which we used was the Content Similarity , where we compute the similarity of the graph without taking into account the weight of the edges and the value similarity where it computed if two common edges have the same weight. The results in tests conducted in tweeter were pretty satisfactory. In most cases we had high similarity between users dealing with common issues. Low similarity between users dealing with different issues and finally our method predicted the relationship between users dealing with related issues.	en
dc.description.statementofresponsibility	Ιωάννης Α. Βιόλος	el
dc.language.iso	el	en
dc.rights	ETDFree-policy.xml	en
dc.subject	Κοινωνικά δίκτυα	el
dc.subject	Σύστημα συστάσεων	el
dc.subject	Τριγράμματα	el
dc.subject	Σύγκριση γράφων	el
dc.subject	Ομοιότητα περιεχομένου	el
dc.subject	Ομοιότητα κατά αξία	el
dc.subject	Συνεργατικά φίλτρα	el
dc.subject	Φίλτρα που βασίζονται στο περιεχόμενο	el
dc.subject	Τουίτερ	el
dc.subject	Αυτοματοποιημένη περίληψη πολλών κειμένων	el
dc.subject	Social networks	en
dc.subject	Recommendation system	en
dc.subject	Trigram	en
dc.subject	Comparing graphs	en
dc.subject	Containment similarity	en
dc.subject	Value similarity	en
dc.subject	Collaborative filtering	en
dc.subject	Content based filtering	en
dc.subject	Twiiter	en
dc.subject	MultiDocument summarization	en
dc.title	Υλοποίηση Αλγόριθμου Συστάσεων που με χρήση Τριγραμμάτων	el
dc.title.alternative	Algorithm Implementation of a Recommendation System which uses threegrams	en
dc.type	bachelorThesis	el (en)
dc.date.accepted	2013-12-16	-
dc.date.modified	2013-12-17	-
dc.contributor.advisorcommitteemember	Καγιάφας, Ελευθέριος	el
dc.contributor.advisorcommitteemember	Λούμος, Βασίλειος	el
dc.contributor.committeemember	Βαρβαρίγου, Θεοδώρα	el
dc.contributor.committeemember	Καγιάφας, Ελευθέριος	el
dc.contributor.committeemember	Λούμος, Βασίλειος	el
dc.contributor.department	Εθνικό Μετσόβιο Πολυτεχνείο. Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών. Τομέας Επικοινωνιών, Ηλεκτρονικής και Συστημάτων Πληροφορικής.	el
dc.date.recordmanipulation.recordcreated	2014-04-01	-
dc.date.recordmanipulation.recordmodified	2014-04-01	-