Semi-automatic sentiment analysis: Results from a case study in Portuguese Web 2.0 sites

Gleicon Moraes, Marco Aurélio Gerosa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Web 2.0 applications for social networking provide data about users' mood and opinions in almost real time. Many applications are taking advantage of these data to derive business intelligence. However, the volume of data makes it hard and error-prone to classify sentiments and opinions manually. The combination of data mining techniques and a pipeline to process data from Web 2.0 applications, such as Twitter, Facebook, and Wordpress, makes it possible to apply natural language processing and machine learning techniques to automate partially this task. Therefore, the amount of manual classification is reduced, as the incoming data has already a classification tag that can be easily changed, feeding back the classifier. There is room for improvements and a Brazilian Portuguese Corpus was created to do the initial training of the classifier. The code used for this testing was based on open source libraries and is available as a test bed for different corpora and new algorithms.

Original languageEnglish (US)
Title of host publicationProceedings of the IADIS International Conference WWW/Internet 2011, ICWI 2011
EditorsLuis Rodrigues, Bebo White, Pedro Isaias, Flavia Maria Santoro
PublisherIADIS
Pages443-447
Number of pages5
ISBN (Electronic)9789898533012
StatePublished - 2011
Externally publishedYes
EventIADIS International Conference WWW/Internet 2011, ICWI 2011 - Rio de Janeiro, Brazil
Duration: Nov 5 2011Nov 8 2011

Publication series

NameProceedings of the IADIS International Conference WWW/Internet 2011, ICWI 2011

Conference

ConferenceIADIS International Conference WWW/Internet 2011, ICWI 2011
Country/TerritoryBrazil
CityRio de Janeiro
Period11/5/1111/8/11

ASJC Scopus subject areas

  • Signal Processing
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Semi-automatic sentiment analysis: Results from a case study in Portuguese Web 2.0 sites'. Together they form a unique fingerprint.

Cite this