The result of a simple experiment with an artificial corpus (modified from the previous labeling experiment, adding one-word utterances such as 'illo' and sentences about the ambience such as 'ilfacefrigide' ("It's cold.")):
lo,778
il,531*
loes,456
illo,301*
le,271*
un,268*
illoes,262
angulo,261*
au,242
ta,204
re,188
as,176
verde,171*
esun,160
blau,157*
rubie,156*
ilface,153
circulo,143*
rectangulo,132*
loesverde,131
triangulo,129*
anguloes,125
loesblau,122
loesrubie,120
lor,104
tomas,104*
illoesun,83
ilfacefrigide,80
....
* Intended words are marked with '*'.* The numbers are the frequency of strings in the corpus of 1000 utterances.
No comments:
Post a Comment