SABINE

When citing please use:
Castano, S., Ferrara, A., Gallinucci, E., Golfarelli, M., Montanelli, S., Mosca, L., Rizzi, S., Vaccari, C.: Sabine: A multi-purpose dataset of semantically-annotated social content (2018), big.csr.unibo.it/sabine.

PURL: http://purl.org/sabine

SABINE (SociAl Business INtelligence bEnchmark) is a multi-purpose dataset for Social Business Intelligence (SBI) in the domain of European politics. SABINE includes 6 millions bilingual clips crawled from almost 50 000 web sources, each associated with metadata and sentiment scores; an ontology with 400 topics, their occurrences in the clips, and their mapping to DBpedia; and two multidimensional cubes for analyzing and aggregating sentiment and occurrences.

SABINE is designed and properly packaged for modular download, to enable the evaluation of a wide variety of social business intelligence research tasks, either separately or in combination, ranging from those more focused on content analysis, to those related to semantic analysis up to more comprehensive social business analytics. Download links are provided after picture.

  • Clips (6.06 GB; unzipped 36.5 GB) - Download
    • Italian Clips (2.05 GB; unzipped 12.4 GB) - Download
      • Italian Clips with validated sentiment (166 KB; unzipped 1 MB) - Download
    • English Clips (4.01 GB; unzipped 24.1 GB) - Download
      • English Clips with validated sentiment (168 KB; unzipped 1 MB) - Download
  • Crawler Annotations (189 MB; unzipped 2.52 GB) - Download
    (Note: package Clips is also required to use this package)
  • Sentiment (15 MB; unzipped 83 MB) - Download
    (Note: package Clips is also required to use this package)
  • Topic occurrences (102 MB; unzipped 331 MB) - Download
    (Note: packages Clips and Topic Ontology are also required to use this package)
  • Topics & mappings (608 KB) - Download
    • Topic Ontology (56 KB) - Download
    • Linked DBpedia Resources (546 KB) - Download
      (Note: package Topic Ontology is also required to use this package)
    • Inter Language Mappings (6 KB) - Download
      (Note: package Topic Ontology is also required to use this package)
  • MD cubes (1.18 GB; unzipped 8.14 GB) - Download
    • Sentiment Cube (315 MB; unzipped 3.25 GB) - Download
    • Semantic Occurrence Cube (896 MB; unzipped 4.89 GB) - Download
    • Inquiries (372 KB) - Download
      (Note: packages Sentiment Cube and Semantic OccurrenceCube are also required to use this package)

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.