SyncOccur corpus

The SyncOccur corpus is introduced in the following paper :

  • [2013,inproceedings] bibtex
    M. Ramona and G. Peeters, "AudioPrint: an efficient audio fingerprint system based on a novel cost-less synchronization scheme," in Proc. ICASSP '13, 2013.
    @inproceedings{bibA14,
      author = {Mathieu Ramona and Geoffroy Peeters},
      Month = {May 26-31},
      Title = {AudioPrint: an efficient audio fingerprint system based on a novel cost-less synchronization scheme},
      Year = {2013},
      Booktitle = {Proc. {ICASSP} '13}
    }

Presentation

It consists of 10.000 pairs of 20 seconds audio chunks that are very precisely aligned (i.e. remain simultaneous during the whole duration when played at the same time). Each pair also comes with a stereo mix of the to chunks. These pairs of audio samples were extracted from a real-world corpus of french radio broadcast stations, using the method described in the following article, and its complementary page about audio temporal alignment.

  • [2011,inproceedings] bibtex
    M. Ramona and G. Peeters, "Automatic alignment of audio occurrences: application to the verification and synchronization of audio fingerprinting annotation," in Proc. DAFX '11, 2011, pp. 429-436.
    @inproceedings{bibA11,
      author = {Mathieu Ramona and Geoffroy Peeters},
      Month = {September},
      Title = {Automatic alignment of audio occurrences: application to the verification and synchronization of audio fingerprinting annotation},
      Year = {2011},
      Booktitle = {Proc. {DAFX} '11},
      Pages = {429--436}
    }

To improve the durability and consistancy of this corpus, we followed the documentation protocol advised in the recent paper by Peeters and Fort, where they propose a methodology for providing new corpora to the MIR community:

  • [2012,inproceedings] bibtex
    G. Peeters and K. Fort, "Towards a better definition of the description of annotated MIR corpora," in Proc. ISMIR '12, 2012.
    @inproceedings{bib509,
      author = {Geoffroy Peeters and Kar{\"e}n Fort},
      Title = {Towards a better definition of the description of annotated {MIR} corpora},
      Year = {2012},
      Booktitle = {Proc. {ISMIR} '12}
    }

Examples

As explained in the corpus XML description, each item comes in the form of three audio files :

  • The scaled file is the original item, as learnt in the database, after a time-warping (hence the "scaled" attribute) to synchronize it with the stream file.
  • The stream file is the occurrence of the item of the test audio stream (radio broadcast recordings).
  • The mix file combines the two previous audio chunks in a stereo mix in order to assess the synchronicity.

The table hereafter provides a few examples of the corpus content.

Item Scaled (Mono) Stream (Mono) Mix (Stereo)
oc00002
oc00003
oc00004
oc00005
oc00006

Download

The XML description and annotations are freely available on github at
https://github.com/hibooo/syncoccur.
Please contact the author (see contact panel on the right) to ask for the audio part of the corpus.