Close

Transliteration protocol ideas

A project log for Tactile Ear

The translation of phonemic sounds of speech in to tactile feedback.

revisionReVision 04/05/2019 at 18:560 Comments

My concept is to break language in to 4 groups of 8 phonemes. Additional sounds would be created by combining these like you would colours, for example the word 'of' would begin with the 'o' sound, and 'us' would begin with the 'u' sound, but the vowels in the word 'boot' would trigger both the 'o' and 'u''.

My example of how the channels are broken down.

12345678
Soft consonants
Quiet
C/S(+ch/sh) F/Ph/Gh/JHThWY**
Soft consonants
Loud
LMNRVXZ*
Hard consonants

BC/K/QDGJPT*
Vowels

AEIOU***

*denotes spare channels, which would probably be assigned later, or used as a modifier like an accent.

The output box would be fed via a serial stream at a suitable speed that could be determined by experiment, but as fast speech is 10 syllables per second, then a sample rate of 50 phonemes per second should be the bare minimum.

If the signal running as 1 bit per phoneme then each sample would be 32bit uncompressed, with each phoneme type being represented by 2 hexadecimal digits

in this instance the word 'wigwam' would be

00001000 00000000 00000000 00000000 #08000000 W

00000000 00000000 00000000 00100000 #00000020 "I"

00000000 00000000 00010000 0000000 #00002000 "G"

00001000 00000000 00000000 00000000 #08000000 W

00000000 00000000 00000000 10000000 #00000080 "A"

00000000 01000000 00000000 00000000 #00400000 "M"

As a scripted file, in the manner of standard subtitle files each line starts as follows...

hh:mm:ss.mms

hh is hours 
mm is minutes
ss is seconds
mms is milliseconds

so the above script would be...

00:00:00.00 #08000000
00:00:00.10 #00000020
00:00:00.20 #00002000
00:00:00.30 #08000000
00:00:00.40 #00000080
00:00:00.50 #00400000
00:00:00.55 #00000000

the last part being return to silence.

Possible pattern of contact points.

Improved pattern of contact points.

Any thought would be appreciated.

Discussions