Lexica and corpora for speech-to-speech translation components