SoBigData proposes to create the Social Mining & Big Data Ecosystem: a research infrastructure (RI) providing an integrated ecosystem for ethic-sensitive scientific discoveries and advanced applications of social data mining on the various dimensions of social life, as recorded by “big data”. SoBigData will open up new research avenues in multiple research fields, including mathematics, ICT, and human, social and economic sciences, by enabling easy comparison, re-use and integration of state-of-the-art big social data, methods, and services, into new research. It will not only strengthen the existing clusters of excellence in social data mining research, but also create a pan-European, inter-disciplinary community of social data scientists, fostered by extensive training, networking, and innovation activities.
In addition, as an open research infrastructure, SoBigData promotes repeatable and open science. Although SoBigData is primarily aimed at serving the needs of researchers, the openly available datasets and open source methods and services provided by the new research infrastructure will also impact industrial and other stakeholders (e.g. government bodies, non-profit organisations, funders, policy makers).
WHAT WE DO
SoBigData is a research infrastructure (RI) for ethic-sensitive scientific discoveries and advanced applications of social data mining to the various dimensions of social life, as recorded by “big data”.
The research community uses the SoBigData RI facilities as a “secure digital wind-tunnel” for large-scale social data analysis and simulation experiments.
SoBigData serves the wide cross-disciplinary community of data scientists, i.e., researchers studying all aspects of societal complexity from a data- and model-driven perspective, including data and text miners, visual analytics researchers, socio-economic scientists, network scientists, political scientists, humanities researchers, and more.
The RI provides the common ground for numerous H2020 projects aimed at tackling societal challenges like demographic change and well-being; secure, clean, efficient and integrated transport; Europe in the changing world; digital human.
THE SoBigData RI PILLARS
An ever-growing, distributed data ecosystem for procurement, access and curation of big social data, to underpin social data mining research within an ethic-sensitive context. This will be based on innovative strategies for acquiring social big data for research purposes, using both opportunistic means offered by social sensing technologies and participatory means based on user involvement as prosumers of social data and knowledge.
An ever-growing, distributed platform of interoperable, social data mining methods and associated skills: tools, methodologies and services for mining, analysing, and visualising complex and massive datasets, harnessing the techno-legal barriers to the ethically safe deployment of big data for social mining.
Transnational and Virtual Access
A starting community of scientific, industrial, and other stakeholders (e.g. policy makers), supported by transnational and virtual access activities, and brought together by extensive networking and innovation actions (e.g. workshops, summer schools, datathons, training resources in social data mining, knowledge transfer, industrial partnerships). In particular, the training events and resources are aimed at creating a new generation of multi-disciplinary social data scientists, with focus on early career researchers and promoting gender balance.
RESEARCH: SIX THEMATIC CLUSTERS
SoBigData open up new research avenues in multiple research fields, including mathematics, ICT, and human, social and economic sciences, by enabling easy comparison, re-use and integration of state-of-the-art research data, methods, and services, into new research. It will contribute to the creation of a new community of researchers – data scientists – ready to exploit the opportunities of big data and to incorporate it in data-driven science and innovation. In addition, as an open research infrastructure, SoBigData will promote repeatable and open science.
text and social media mining