[Home]

Summary:ASTERISK-05070: [patch] SayAlpha does not read special characteres such as "," and "/"
Reporter:Leo Burd (leoburd)Labels:
Date Opened:2005-09-13 06:41:07Date Closed:2007-11-20 17:26:38.000-0600
Priority:MinorRegression?No
Status:Closed/CompleteComponents:Core/General
Versions:Frequency of
Occurrence
Related
Issues:
Environment:Attachments:( 0) 5208.txt
( 1) chars.tar
Description:SayAlpha does not read common special characteres that are associated with email addresses and URLs.  It would be great if it could read characteres such as ":" ";" "_" "/" ","

Thanks,

Leo
Comments:By: Russell Bryant (russell) 2005-09-16 20:10:14

... Gastman?  Please be careful when filling out your bug report.  :)

By: Clod Patry (junky) 2005-09-16 21:29:55

if you have the latest say.c version,
" ", "/" is currently supported.

I just patch say.c
to add these characters:    _|\:;,

But for some, you'll have to add a \ before for the pbx_config to ignore these chars in the parsing (only the ,;\ )


exten => 5208,1,SayAlpha(_:/\\ab\;|w..co\,m/);
will gives you:
   -- Executing SayAlpha("SIP/10-c795", "_:/\ab;|w..co,m/") in new stack
   -- Playing 'letters/underscore' (language 'en')
   -- Playing 'letters/colon' (language 'en')
   -- Playing 'letters/slash' (language 'en')
   -- Playing 'letters/backslash' (language 'en')
   -- Playing 'letters/a' (language 'en')
   -- Playing 'letters/b' (language 'en')
   -- Playing 'letters/semicolon' (language 'en')
   -- Playing 'letters/pipe' (language 'en')
   -- Playing 'letters/w' (language 'en')
   -- Playing 'letters/dot' (language 'en')
   -- Playing 'letters/dot' (language 'en')
   -- Playing 'letters/c' (language 'en')
   -- Playing 'letters/o' (language 'en')
   -- Playing 'letters/comma' (language 'en')
   -- Playing 'letters/m' (language 'en')
   -- Playing 'letters/slash' (language 'en')


just untar chars.tar in sounds dir and report how it works.

Dont laught on my sexy voice and thanks to Ariel for that brief english lesson :)



By: Clod Patry (junky) 2005-09-17 12:34:16

Patch build on say.c Revision: 1.69
CVS-HEAD 2005-09-16 05:14:37 UTC

Disclaimer on file.

By: Clod Patry (junky) 2005-09-22 05:58:26

leoburd: could you confirm, it works fine for you?

By: Leo Burd (leoburd) 2005-09-23 09:46:58

Thanks so much for the patch.  It seems to be working just fine!

BTW, I think it would be good to apply the same patch to SayPhonetic.  Moreover, it's weird that SayDigits only reads symbols such as "*" and "-".  It should also read "/" and "+"...

All the best,

.L.

By: Clod Patry (junky) 2005-09-23 10:57:28

we'll need Allison for these prompts now:
- colon
- semicolon
- underscore
- comma
- pipe
- backslash

Is it possible to get all these prompts recorded?

By: Michael Jerris (mikej) 2005-12-01 13:51:39.000-0600

Can we please get an updated patch for this for current svn trunk and the additional prompts that will be needed.  Thanks.

By: Jason Parker (jparker) 2006-01-05 12:49:47.000-0600

If we're going to add these, would anybody be against adding a few more?

I'm thinking `~%^&()[]{}'"?<> and perhaps anything else that isn't on my US-centric keyboard.


Does anybody with some disposable cash have any extreme need for these?  I think that's all this issue is waiting on.  By my count, the attached patch and my proposed additional chars, are 42 "words".  Then again, I could be wrong...

By: Clod Patry (junky) 2006-01-06 10:27:32.000-0600

So far, we just need Allison to record prompts and i think that's it for the first step.

By: Matt O'Gorman (mogorman) 2006-01-10 16:28:18.000-0600

I will take care of the prompts, can some one please list all of the words that need to be done.

By: Clod Patry (junky) 2006-01-10 16:40:15.000-0600

see my note 0033842:
- colon
- semicolon
- underscore
- comma
- pipe
- backslash

By: Matt O'Gorman (mogorman) 2006-01-10 17:00:25.000-0600

i thought people also wanted this  `~%^&()[]{}'"?<> as well

By: Jason Parker (jparker) 2006-01-10 17:05:15.000-0600

That was more of a "well...while we're at it..." thing.

By: Clod Patry (junky) 2006-01-10 18:58:25.000-0600

just to make sure my english is okay here:
~ = tildard
% = percent
& = and
( = left braket
) = right braket

whats the other ones?



By: Jason Parker (jparker) 2006-01-10 21:30:13.000-0600

Of the ones I posted, you've got...(in order)

`~%^&()[]{}'"?<>

backtick, tilde, percent, caret, ampersand, left parenthesis (or open parenthesis), right parenthesis (or close parenthesis), left square brace, right square brace, left curly brace, right curly brace, apostrophe, quotation, question mark, left angle bracket (or less than), right angle bracket (or greater than)


Something like that anyhow...  I was also thinking about the pound character (not #)...Is there another name that could be used for that?

By: Clod Patry (junky) 2006-01-12 00:25:30.000-0600

Not that I know, I'll provide a patch soon, for SayAlpha, with your last note north.
Thanks.


Now the question is more: do we want it in SayPhonetic and SayDigits too?
By consequent, SayNumber should be done too.

By: Jason Parker (jparker) 2006-01-12 00:33:12.000-0600

I don't think saydigits needs it.  sayphonetics probably does though.

By: Matt O'Gorman (mogorman) 2006-01-12 09:30:27.000-0600

id have to say SayPhonetic  is best so if i am not mistaken the words we want are
colon                  :
semicolon              ;
underscore             _
dash                   -
comma                  ,
pipe                   |
backslash              \
back tick              `
tilde                  ~
percent                %
carrot                 ^
ampersand              &
open paranthese        (
closed paranthese      )
open bracket           [
closed bracket         ]
open curly brace       {
closed curly brace     }
single quote           '
double quote           "
question mark          ?
less than              <
greater than           >
slash                  /
at                     @
pound                  #
cash                   $
exclamation point      !


hmm thats all the symbols on my keyboard.... anyone else want to add any before i submit for prompts

By: Leo Burd (leoburd) 2006-01-12 14:58:15.000-0600

How do you think SayAlpha (and related commands) should read common foreign combinations such as ç, ã, õ, ñ, à, á, é, í, ü, ö and the like?  Should it just read like the letter associated with the special symbol, i.e. 'ã' should be read as 'a' and, 'ç' should be read as 'c'?

By: Clod Patry (junky) 2006-01-15 13:52:32.000-0600

I think if its an 'é', or an 'à', these should be stream like:
(in french)
e accent aigu
a accent grave, etc.

But these char only exists in french, spanish (and other languages), so maybe we could put a lot of these chars in that function or simply switch ast_say_character_str_full() to ast_say_character_str_full_fr(), ast_say_character_str_full_en(), etc, to be consistent with other functions in say.c.

By: Clod Patry (junky) 2006-02-13 23:16:50.000-0600

mogorman: Any news here?
Did you get any record yet?

That issue will be really easy to fix, the moment we'll get the record.

By: Matt O'Gorman (mogorman) 2006-02-22 13:15:46.000-0600

i have the recordings i just need to splice them, please follow through with the rest of the code junk-y

By: Clod Patry (junky) 2006-05-05 18:57:16

mogorman: feel free to place the prompts in the trunk, i'll update this patch when  i'll have final prompt.

By: Serge Vecher (serge-v) 2006-06-07 15:59:49

*ping*

By: Serge Vecher (serge-v) 2006-07-07 15:38:24

ping

By: zedders (zedders) 2006-07-09 10:17:41

As an extra, if you are able to organise recordings, could you get # recorded as 'hash' too for us British folk? :-)

By: Serge Vecher (serge-v) 2006-09-01 14:10:21

ping

By: jmls (jmls) 2006-10-31 12:48:24.000-0600

guys, we've been pinging this since June ...

By: jmls (jmls) 2007-01-07 03:05:57.000-0600

perhaps we should close this - no response since June '06 ...

By: Clod Patry (junky) 2007-02-18 00:40:48.000-0600

I vote for this one should not be closed, but process.
This is not hard, we just have to get the prompts (mogorman said he had it).
Can we just commit that ticket, which is opened for like 1.5 years.

By: Serge Vecher (serge-v) 2007-03-07 13:10:09.000-0600

Steve, since you've done an excellent job with the sounds and, I'm guessing, have access to mogorman's stuff, would you please take this abandoned issue under your wing?

By: Steve Murphy (murf) 2007-08-24 11:26:01

Thought I'd post my research on this issue; Kevin Fleming (who would OK digium spending on this) doesn't think there is demand enough for the following chars, to justify the expense. So, we need to narrow down the list to just what would be really be minimally necessary....


--------------

Here is what I've gathered together for translation so far:


We need to have the following words recorded, so that SayAlpha will be
able to handle all the characters in the ASCII set.

----------
Already available:
----------
We already have these chars available in the digits/ and letters/
directories:

char (ascii code) "phrase(dir)"

(20) "space(letters)"
!(21) "exclamation point(letters)"
#(23) "pound(digits)"
$(24) "dollar(letters)"
*(2A) "star(digits)"
+(2B) "plus(letters)"
-(2D) "minus(digits)" "dash(letters)"
.(2E) "dot(letters)"
/(2F) "slash(letters)"
=(3D) "equals(letters)"
@(40) "at(digits+letters)"

Since the above are already recorded in all 3 languages, we do not need
to repeat this work! I'm showing them here to calm the worries of anyone
who thinks we are missing something!



----------
Need to add:
----------

ENGLISH

"(22) "double quote"
#(23) "hash"               (this one is optional for (es,fr), as
"pound"
                           is already recorded in all 3 languages.)
%(25) "percent"
&(26) "ampersand"
'(27) "single quote"
((28) "left parenthesis"
)(29) "right parenthesis"
*(2A) "Asterisk"
,(2C) "comma"
:(3A) "colon"
;(3B) "semicolon"
<(3C) "less-than"
>(3E) "greater-than"
?(3F) "question mark"
[(5B) "left square brace"
\(5C) "back-slash"
](5D) "right square brace"
^(5E) "caret"
_(5F) "underscore"
`(60) "back-quote"
{(7B) "left curly brace"
|(7C) "pipe"
}(7D) "right curly brace"
~(7E) "tilde"
(7F) "DEL"
     "control"  (a prefix for var. chars to give "Control A" for
                 ascii (01), etc.)
     "capital"  (a prefix before a letter, to differentiate A from a.)
     "small"    (a prefix before a letter, to differentiate a from A.)

Also, for es, fr, if there are regionally popular names for any of these
character names, as, for example, "hash", and "pound", for US/GB, you
may wish to add these multiple translations, and we can offer the
variants.

Also, if we can develop a short list of prefixes/suffixes for the
popular 8859-1 characters used in (es,fr), (for example "umlaut", if the
language were German), we can pronounce these characters also in (es,
fr), beyond the ascii 1-127 set...

The following characters will allow the rest of the 8859-1 characters to
be pronounced in English; some variations may be necessary for Spanish
and French! To hold down the cost, perhaps we can eliminate the
characters that would not be used in French or Spanish!


     "inverted"  (as in: inverted exclamation point, inverted question
mark)
     "sign"      (as in: cent sign, not sign)
     "cent"      (as in: cent sign)
     "currency"  (as in: currency sign)
     "yen"        (as in: yen sign)
     "broken bar"  
     "section"    (as in section sign)
     "diaeresis"
     "copyright"  (as in copyright sign)
     "feminine"    (as in feminine ordinal indicator)
     "masculine"   (as in masculine ordinal indicator)
     "ordinal indicator"
     "left double-angle"
     "right double-angle"
     "not"          (as in: not sign)
     "soft hyphen"
     "registered"   (as in: registered sign)
     "macron"
     "degree"        (as in: degree sign)
     "plus-minus"    (as in: plus-minus sign)
     "superscript"   (as in: superscript two, superscript three)
     "acute accent"
     "micro"         (as in: micro sign, or the greek letter mu)
     "pilcrow"       (as in: pilcrow sign)
     "middle"        (as in: middle dot)
     "cedilla"
     "quarter"
     "half"
     "three-quarters"
     "with"          (as in: Capital A with grave)
     "grave"         (as in: Capital A with grave)
     "acute"         (as in: Capital A with acute)
     "circumflex"    (as in: Capital A with circumflex)
     "ring above"    (as in: Capital A with ring above)
     "Latin Letter AE"  (as in: Capital Latin Letter AE)
     "Latin Letter Eth" (as in: Capital Latin Letter Eth)
     "multiplication" (as in: multiplication sign)
     "division"       (as in: division sign)
     "stroke"         (as in: SMALL LETTER O WITH STROKE)
     "thorn"          (as in: Latin Letter Thorn)
     "sharp"          (as in: LATIN SMALL LETTER SHARP S)

  (for reference, I provide a table of the characters in 8859-1 below)


      Oct   Dec   Hex   Char   Description

--------------------------------------------------------------------
      240   160   A0           NO-BREAK SPACE
      241   161   A1     ¡     INVERTED EXCLAMATION MARK
      242   162   A2     ¢     CENT SIGN
      243   163   A3     £     POUND SIGN
      244   164   A4     ¤     CURRENCY SIGN
      245   165   A5     ¥     YEN SIGN
      246   166   A6     ¦     BROKEN BAR
      247   167   A7     §     SECTION SIGN
      250   168   A8     ¨     DIAERESIS
      251   169   A9     ©     COPYRIGHT SIGN
      252   170   AA     ª     FEMININE ORDINAL INDICATOR
      253   171   AB     «     LEFT-POINTING DOUBLE ANGLE QUOTATION
MARK
      254   172   AC     ¬     NOT SIGN
      255   173   AD     ­     SOFT HYPHEN
      256   174   AE     ®     REGISTERED SIGN
      257   175   AF     ¯     MACRON
      260   176   B0     °     DEGREE SIGN
      261   177   B1     ±     PLUS-MINUS SIGN
      262   178   B2     ²     SUPERSCRIPT TWO
      263   179   B3     ³     SUPERSCRIPT THREE
      264   180   B4     ´     ACUTE ACCENT
      265   181   B5     µ     MICRO SIGN
      266   182   B6     ¶     PILCROW SIGN
      267   183   B7     ·     MIDDLE DOT
      270   184   B8     ¸     CEDILLA
      271   185   B9     ¹     SUPERSCRIPT ONE
      272   186   BA     º     MASCULINE ORDINAL INDICATOR
      273   187   BB     »     RIGHT-POINTING DOUBLE ANGLE QUOTATION
MARK
      274   188   BC     ¼     VULGAR FRACTION ONE QUARTER
      275   189   BD     ½     VULGAR FRACTION ONE HALF
      276   190   BE     ¾     VULGAR FRACTION THREE QUARTERS
      277   191   BF     ¿     INVERTED QUESTION MARK
      300   192   C0     À     LATIN CAPITAL LETTER A WITH GRAVE
      301   193   C1     Á     LATIN CAPITAL LETTER A WITH ACUTE
      302   194   C2     Â     LATIN CAPITAL LETTER A WITH CIRCUMFLEX
      303   195   C3     Ã     LATIN CAPITAL LETTER A WITH TILDE
      304   196   C4     Ä     LATIN CAPITAL LETTER A WITH DIAERESIS
      305   197   C5     Å     LATIN CAPITAL LETTER A WITH RING ABOVE
      306   198   C6     Æ     LATIN CAPITAL LETTER AE
      307   199   C7     Ç     LATIN CAPITAL LETTER C WITH CEDILLA
      310   200   C8     È     LATIN CAPITAL LETTER E WITH GRAVE
      311   201   C9     É     LATIN CAPITAL LETTER E WITH ACUTE
      312   202   CA     Ê     LATIN CAPITAL LETTER E WITH CIRCUMFLEX
      313   203   CB     Ë     LATIN CAPITAL LETTER E WITH DIAERESIS
      314   204   CC     Ì     LATIN CAPITAL LETTER I WITH GRAVE
      315   205   CD     Í     LATIN CAPITAL LETTER I WITH ACUTE
      316   206   CE     Î     LATIN CAPITAL LETTER I WITH CIRCUMFLEX
      317   207   CF     Ï     LATIN CAPITAL LETTER I WITH DIAERESIS
      320   208   D0     Ð     LATIN CAPITAL LETTER ETH
      321   209   D1     Ñ     LATIN CAPITAL LETTER N WITH TILDE
      322   210   D2     Ò     LATIN CAPITAL LETTER O WITH GRAVE
      323   211   D3     Ó     LATIN CAPITAL LETTER O WITH ACUTE
      324   212   D4     Ô     LATIN CAPITAL LETTER O WITH CIRCUMFLEX
      325   213   D5     Õ     LATIN CAPITAL LETTER O WITH TILDE
      326   214   D6     Ö     LATIN CAPITAL LETTER O WITH DIAERESIS
      327   215   D7     ×     MULTIPLICATION SIGN
      330   216   D8     Ø     LATIN CAPITAL LETTER O WITH STROKE
      331   217   D9     Ù     LATIN CAPITAL LETTER U WITH GRAVE
      332   218   DA     Ú     LATIN CAPITAL LETTER U WITH ACUTE
      333   219   DB     Û     LATIN CAPITAL LETTER U WITH CIRCUMFLEX
      334   220   DC     Ü     LATIN CAPITAL LETTER U WITH DIAERESIS
      335   221   DD     Ý     LATIN CAPITAL LETTER Y WITH ACUTE
      336   222   DE     Þ     LATIN CAPITAL LETTER THORN
      337   223   DF     ß     LATIN SMALL LETTER SHARP S
      340   224   E0     à     LATIN SMALL LETTER A WITH GRAVE
      341   225   E1     á     LATIN SMALL LETTER A WITH ACUTE
      342   226   E2     â     LATIN SMALL LETTER A WITH CIRCUMFLEX
      343   227   E3     ã     LATIN SMALL LETTER A WITH TILDE
      344   228   E4     ä     LATIN SMALL LETTER A WITH DIAERESIS
      345   229   E5     å     LATIN SMALL LETTER A WITH RING ABOVE
      346   230   E6     æ     LATIN SMALL LETTER AE
      347   231   E7     ç     LATIN SMALL LETTER C WITH CEDILLA
      350   232   E8     è     LATIN SMALL LETTER E WITH GRAVE
      351   233   E9     é     LATIN SMALL LETTER E WITH ACUTE
      352   234   EA     ê     LATIN SMALL LETTER E WITH CIRCUMFLEX
      353   235   EB     ë     LATIN SMALL LETTER E WITH DIAERESIS
      354   236   EC     ì     LATIN SMALL LETTER I WITH GRAVE
      355   237   ED     í     LATIN SMALL LETTER I WITH ACUTE
      356   238   EE     î     LATIN SMALL LETTER I WITH CIRCUMFLEX
      357   239   EF     ï     LATIN SMALL LETTER I WITH DIAERESIS
      360   240   F0     ð     LATIN SMALL LETTER ETH
      361   241   F1     ñ     LATIN SMALL LETTER N WITH TILDE
      362   242   F2     ò     LATIN SMALL LETTER O WITH GRAVE
      363   243   F3     ó     LATIN SMALL LETTER O WITH ACUTE
      364   244   F4     ô     LATIN SMALL LETTER O WITH CIRCUMFLEX
      365   245   F5     õ     LATIN SMALL LETTER O WITH TILDE
      366   246   F6     ö     LATIN SMALL LETTER O WITH DIAERESIS
      367   247   F7     ÷     DIVISION SIGN
      370   248   F8     ø     LATIN SMALL LETTER O WITH STROKE
      371   249   F9     ù     LATIN SMALL LETTER U WITH GRAVE
      372   250   FA     ú     LATIN SMALL LETTER U WITH ACUTE
      373   251   FB     û     LATIN SMALL LETTER U WITH CIRCUMFLEX
      374   252   FC     ü     LATIN SMALL LETTER U WITH DIAERESIS
      375   253   FD     ý     LATIN SMALL LETTER Y WITH ACUTE
      376   254   FE     þ     LATIN SMALL LETTER THORN
      377   255   FF     ÿ     LATIN SMALL LETTER Y WITH DIAERESIS

In the above table, I plan to shorten "Latin Small Letter N with Tilde"
to "N with Tilde" in English, and supply a couple of options that would
yield "small N with Tilde" or "capital N with tilde", and so on.

If/when we move to unicode, we can/will use the same pronunciations,
just provide a new encoding to pronounce them...

By: Digium Subversion (svnbot) 2007-11-20 17:26:38.000-0600

Repository: asterisk
Revision: 89467

U   trunk/main/say.c

------------------------------------------------------------------------
r89467 | tilghman | 2007-11-20 17:26:37 -0600 (Tue, 20 Nov 2007) | 3 lines

Add support for new recorded character sounds
Closes issue ASTERISK-5070

------------------------------------------------------------------------