﻿Euroradio.BE-RU corpus is a sentence-aligned Belarusian-Russian parallel corpus derived from the news on euroradion.fm for the year 2016. Overall, it contains ca. 135K sentence pairs. The encoding used is UTF-8.

Both extraction and sentence alignment were done automatically, so there may be some processing mistakes.

The filenames follow the pattern:

  <year>-<month>.<language>

For example, "2016-11.be" is the Belarusian side of the part of the corpus taken from the November 2016 news.