The result of a simple experiment with an artificial corpus (modified from the previous labeling experiment, adding one-word utterances such as 'illo' and sentences about the ambience such as 'ilfacefrigide' ("It's cold.")):
lo,778 il,531* loes,456 illo,301* le,271* un,268* illoes,262 angulo,261* au,242 ta,204 re,188 as,176 verde,171* esun,160 blau,157* rubie,156* ilface,153 circulo,143* rectangulo,132* loesverde,131 triangulo,129* anguloes,125 loesblau,122 loesrubie,120 lor,104 tomas,104* illoesun,83 ilfacefrigide,80
....* Intended words are marked with '*'.
* The numbers are the frequency of strings in the corpus of 1000 utterances.