Benchmark
An open evaluation harness for Irish speech recognition. Same held-out audio, same scoring, for every system. We would rather publish a hard number than a press-release adjective.
Low-resource speech research is full of numbers that cannot be compared because everyone evaluates on different audio with different scoring. BlasBench fixes the audio, fixes the reference transcripts, and fixes the metric, so a word-error-rate from one model means the same thing as a word-error-rate from another.
We hold our own recogniser, Blas Voice, to this bar before it goes near a learner. See the research notes for how it fits the wider speech effort.