Monitoring first broadcast then print media over the last 70 years, nearly half of the annual output of Western intelligence global news monitoring is now derived from Internet–based news, standing testament to the Web’s disruptive power as a distribution medium. Pooling together the global tone of all news mentions of a country over time appears to accurately forecast its near–term stability, including predicting the revolutions in Egypt, Tunisia, and Libya, conflict in Serbia, and the stability of Saudi Arabia. Location plays a critical role in news reporting, and “passively crowdsourcing” the media to find the locations most closely associated with Bin Laden prior to his capture finds a 200km.–wide swath of northern Pakistan as his most likely hiding place, an area which contains Abbottabad, the city he was ultimately captured in. Finally, the geographic clustering of the news, the way in which it frames localities together, offers new insights into how the world views itself and the “natural civilizations” of the news media.
While heavily biased and far from complete, the news media captures the only cross–national real–time record of human society available to researchers. The findings of this study suggest that Culturomics, which has thus far focused on the digested history of books, can yield intriguing new understandings of human society when applied to the real–time data of news. From forecasting impending conflict to offering insights on the locations of wanted fugitives, applying data mining approaches to the vast historical archive of the news media offers promise of new approaches to measuring and understanding human society on a global scale
http://www.carboncapturereport.org/ [ University of Illinois ] vía Monday Reading @fernand0
análisis del valor en reputación positiva o negativa según los términos contenidos en noticias, menciones…
en minería de grandes bases de datos por sentiment mining (reputación positiva o negativa según diccionarios de términos de uno y otro siglo renovados en los últimos años) + full text geocoding.
although the methodology is language-focused it allows the inclusion of aspects of communication or context which are specifically related to computer-mediated communication.
Entre las herramientas semánticas que vienen vamos a encontrar analizadores de términos frecuentes y de etiquetas que se relacionan con las conversaciones de otros. Sobre Twitter y Facebook no paran de salir. He querido perder un rato con una buscando cómo clasifica las prácticas de la microcomunicación en estas redes.
Encuentro simpáticos los adjetivos con que nos clasifica la matriz de influencia de Klout. Casi parece un horóscopo. Se organiza entorno a cuatro ejes delimitados por las oposiciones: compartir / crear, amplio / focalizado, ocasional o consistente, escucha / participación.
1. En el eje de la aportación de contenidos a públicos La figura del alimentador (Feeder) aporta información a un sector. La del sindicador (Syndicator) selecciona temas o según interés de un público concreto. El difusor (Broadcaster) ofrece más contenido a audiencias también amplias. Y por fin, el curador (Curator) se describe como un administrador experimentado en muchas fuentes y con numerosos lectores.
2. En el eje de la orientación a públicos de contenidos, sitúa abajo al l líder intelectual (Thought Leader) no sólo es conocedor de su campo; también es seguido por su opinión de la realidad presente al alcance de unos lectores que conoce bien. El “degustador” (Taste Maker) es reconocido como indicador de tendencias para un público. El experto (Pundit) es una voz autorizada y ampliamente reconocida en un sector. El famoso (Celebrity) es centro de miradas, foco de atención con el máximo efecto amplificador.
3. En el eje de la focalización y consistencia de los contenidos: El / la socializador/a (Socializer) es una persona de comunidad apreciada por su activa generosidad. El 7 la coordinador de red (Networker) conoces sus grupos y colabora con ellos. A la persona activista se la relaciona además con una causa concreta. Y un especialista obtiene un alto grado de confianza en la selecta audiencia de unos contenidos precisos.
4. Desde la observación distante y una baja atención se definen las figuras periféricas de la conversación en en Twitter y Facebook que analiza Klout. En su punto más activo con la figura de conversador se relaciona con la información de primera mano y precisamente contada. El/la aficionado/a busca en quienes apoyarse para desarrollar su red. El/a explorador/a usa e innova para seguir con las evoluciones actuales. La figura del observador ocupa una posición frenada en la escucha para decidir actuaciones próximas en las redes.
It’s one of those social phenomena that has so embedded itself in the culture that we don’t even notice it. It developed its own syntax, its own meaning, and even shifted the boundaries of cultural mores and social intercourse. Even I didn’t realize it was so widespread until I started researching this article. And yet, at least in the middle of the decade, it spanned all continents and was accounting for more than half of cellphone traffic in many developing countries.
(…)
…the missed call is not some reflection of not having enough credit. Its a medium of exchange of complex messages that has become surprisingly refined in a short period. Much of it is not communication at all, at least in terms of actual information. The interaction is the motivation, not the content of the message itself. Or, as a Filipino professor, Adrian Remodo put it to a language conference in Manila in 2007 at which they voted to make miscall, or miskol in Tagalog, the word of the year: A miskol is often used as “an alternative way to make someone’s presence felt.
undergoing a set of complex vetting procedures, involving authorities such as publishing houses, editors, librarians and academics (see Ryder and Wilson, 1996). By comparison, web space is open, unstructured, and quintessentially anarchic. The scholarly sits side-by-side with the journalistic, the institutional with the personal, the factual with the fictitious. Geographical origin, authorship and communicative intent (and thus genre) are notoriously difficult to establish:
We all know (and may ourselves have voiced) the complaints about online information: there is too much ephemeral content of dubious reliability: journalistic, commercial and personal texts of unknown authorship and authority abound; assertions are intermingled with and represented as established fact, and details of sources and research methodology are documented haphazardly at best. (Fletcher, 2001: 10)
This lack of pre-ordering, and the indiscriminate mixing of voices and genres, probably goes quite a long way towards explaining why critical discourse analysts are often reluctant to mine the web for data. In the absence of gatekeepers, who structure and vet content in the traditional media, the onus falls on the researcher to establish the nature of the data that search engines have laid before him or her, and to select those sources that will be useful in answering specific research questions.