Técnicas big data: Análisis de textos a gran escala para la investigación científica y periodística Academic Article

journal

  • Profesional de la Informacion

abstract

  • This paper conceptualizes the term big data and describes its relevance in social research and journalistic practices. We explain large-scale text analysis techniques such as automated content analysis, data mining, machine learning, topic modeling, and sentiment analysis, which may help scientific discovery in social sciences and news production in journalism. We explain the required e-infrastructure for big data analysis with the use of cloud computing and we asses the use of the main packages and libraries for information retrieval and analysis in commercial software and programming languages such as Python or R.

publication date

  • 2016-1-1

edition

  • 25

keywords

  • content analysis
  • data analysis
  • information retrieval
  • infrastructure
  • journalism
  • learning
  • news
  • programming language
  • research practice
  • social research
  • social science
  • software
  • text analysis

International Standard Serial Number (ISSN)

  • 1386-6710

number of pages

  • 9

start page

  • 623

end page

  • 631