Summary: | DAHLIN-00126: [patch] Console full of PRI got event: HDLC Abort (6) on Primary D-channel of span 1 | ||
Reporter: | Alec Davis (alecdavis) | Labels: | |
Date Opened: | 2009-07-14 04:28:25 | Date Closed: | 2010-04-13 14:54:59 |
Priority: | Blocker | Regression? | No |
Status: | Closed/Complete | Components: | wcte12xp |
Versions: | 2.2.0-rc2 | Frequency of Occurrence | |
Related Issues: | |||
Environment: | Attachments: | ( 0) hdlc_Abort.txt ( 1) mantis-15498-1.patch ( 2) pri-intense-DAHDI-Version-SVN-trunk-r6005.txt | |
Description: | Since Feb we've been running the following version with no problem. DAHDI Version: SVN-trunk-r6005 Echo Canceller: After upgrade to DAHDI Version: 2.2.0 Echo Canceller: we get the following constantly on the console and hardly ever able to establish a call == Primary D-Channel on span 1 down [Jul 14 21:20:31] WARNING[1464]: chan_dahdi.c:3546 pri_find_dchan: No D-channels available! Using Primary channel 16 as D-channel anyway! == Primary D-Channel on span 1 up [Jul 14 21:20:31] NOTICE[1464]: chan_dahdi.c:11381 pri_dchannel: PRI got event: HDLC Abort (6) on Primary D-channel of span 1 [Jul 14 21:20:32] NOTICE[1464]: chan_dahdi.c:11381 pri_dchannel: PRI got event: HDLC Abort (6) on Primary D-channel of span 1 [Jul 14 21:20:33] NOTICE[1464]: chan_dahdi.c:11381 pri_dchannel: PRI got event: HDLC Abort (6) on Primary D-channel of span 1 [Jul 14 21:20:34] NOTICE[1464]: chan_dahdi.c:11381 pri_dchannel: PRI got event: HDLC Abort (6) on Primary D-channel of span 1 [Jul 14 21:20:35] NOTICE[1464]: chan_dahdi.c:11381 pri_dchannel: PRI got event: HDLC Abort (6) on Primary D-channel of span 1 == Primary D-Channel on span 1 down [Jul 14 21:20:36] WARNING[1464]: chan_dahdi.c:3546 pri_find_dchan: No D-channels available! Using Primary channel 16 as D-channel anyway! == Primary D-Channel on span 1 up [Jul 14 21:20:36] NOTICE[1464]: chan_dahdi.c:11381 pri_dchannel: PRI got event: HDLC Abort (6) on Primary D-channel of span 1 ****** ADDITIONAL INFORMATION ****** Tried DAHDI Version: 2.2.0-rc2 Echo Canceller: Same messages. Tried DAHDI Version: 2.2.0-rc1 Echo Canceller: Which is earlier than SVN-trunk-r6005 and no problem. /etc/dahdi/system.conf span=1,0,0,ccs,hdb3,crc4 bchan=1-15,17-31 # set this to 1-15,17-31 for E1 dchan=16 # set this to 16 for E1 /etc/asterisk/chan_dahdi.conf group = 0 pridialplan = unknown prilocaldialplan = unknown facilityenable=yes transfer=yes switchtype=qsig signalling=pri_cpe qsigchannelmapping=physical echocancel=yes echotraining=900 echocancelwhenbridged=no rxgain=0.0 txgain=0.0 channel => 1-15,17-31 | ||
Comments: | By: Alec Davis (alecdavis) 2009-07-14 04:57:54 uploaded hdlc_Abort.txt DAHDI Version: 2.2.0-rc2 Echo Canceller: which is a PRI call into asterisk voice mail with 'pri intense debug span 1' By: Alec Davis (alecdavis) 2009-07-14 05:11:26 uploaded pri-intense-DAHDI-Version-SVN-trunk-r6005.txt which is the same call sequence as previous note but into dahdi version r6005. By: Michael L. Young (elguero) 2009-07-14 11:26:28 Not sure if this helps but shouldn't your span be getting its timing from the network since I see you have signalling set as pri_cpe? span=1,1,0,ccs,hdb3,crc4 instead of span=1,0,0,ccs,hdb3,crc4 Just a quick thought. Also in pri-intense debug file I saw this: [Jul 14 22:00:17] WARNING[3828]: app_voicemail.c:8741 vm_authenticate: Couldn't read username By: Alec Davis (alecdavis) 2009-07-14 14:31:33 elguero: You maybe onto it. But even if it does fix it, I wonder why it's worked for so long, as no configurations changed just DAHDI. By: Alec Davis (alecdavis) 2009-07-14 14:45:46 Just tried span=1,1,0,ccs,hdb3,crc4 and same results. fails: (only tested) with DAHDI Version: 2.2.0-rc2 works: DAHDI-Version-SVN-trunk-r6005 elguero: the vm_authenticate error was becasue I hungup early. By: Alec Davis (alecdavis) 2009-07-15 05:02:12 r6527 is the beginning of the HDLC ABORT errors. In an attempt to find out at which patch release it breaks did the following; svn checkout http</u>://svn.digium.com/svn/dahdi/linux/trunk@6526 dahdi-linux-6526 and svn checkout http</u>://svn.digium.com/svn/dahdi/linux/trunk@6527 dahdi-linux-6527 and obviously a 'make' and 'make install' from appropriate dirs. 6526 doesn't have HDLC_ABORTS but does have [Jul 15 21:41:52] WARNING[13892]: chan_dahdi.c:2111 dahdi_enable_ec: Unable to enable echo c 6527 does have the HDLC_ABORTS 6527 has significant changes regarding voicebus voicebus: Move common vpmadt032 interface into voicebus module. By: Shaun Ruffell (sruffell) 2009-07-15 14:13:53 alecdavis: I've attached mantis-15489-1.patch which I believe will fix this. Could you give it a try in your environment? By: Alec Davis (alecdavis) 2009-07-15 16:36:00 sruffell: This to hopefully fix the HDLC ABORTS? Or does it fix the problem mentioned in the last note of "unable to enable echo canceller"? Will test after hours tonight, 10 hours away at least (NZST). Alec By: Shaun Ruffell (sruffell) 2009-07-15 16:42:17 alecdavis: The HDLC aborts. You should be able to apply that patch to 2.2.0.1 and be up and running, but I'm testing it some more myself before merging. By: Alec Davis (alecdavis) 2009-07-15 17:43:03 Are any other cards affected in the same way? We use the TE220/wct4xxp in 2 of our other sites? By: Shaun Ruffell (sruffell) 2009-07-15 17:47:48 No...this particular issue would only affect the wcte12xp based cards. By: Alec Davis (alecdavis) 2009-07-16 00:06:40 Managed to get mantis-15489-1.patch tested before end of day. Applied mantis-15489-1.patch to dahdi-linux-6527 result No HDLC Aborts, but far end could hardly hear me, static etc. Then applied to dahdi-2.2.0 tag release. result No HDLC Aborts, and good audio both ways. Alec By: Digium Subversion (svnbot) 2009-07-16 12:29:54 Repository: dahdi Revision: 6844 U linux/trunk/drivers/dahdi/wcte12xp/base.c ------------------------------------------------------------------------ r6844 | sruffell | 2009-07-16 12:29:54 -0500 (Thu, 16 Jul 2009) | 10 lines wcte12xp: Disable vpmadt032 companding by default. This fixes a regression in 2.2.0 where certain configurations will fail patloop test or have repeated HDLC aborts because the VPMADT032 is modifying the clear channel or d channel data streams. This restores the behavior to how it was in dahdi-linux 2.1.0.4. (closes issue DAHLIN-126) Reported by: alecdavis Tested by: alecdavis ------------------------------------------------------------------------ http://svn.digium.com/view/dahdi?view=rev&revision=6844 By: Digium Subversion (svnbot) 2009-07-21 13:11:57 Repository: dahdi Revision: 6864 _U linux/branches/2.2/ U linux/branches/2.2/drivers/dahdi/dahdi-base.c U linux/branches/2.2/drivers/dahdi/tor2.c U linux/branches/2.2/drivers/dahdi/wct4xxp/base.c U linux/branches/2.2/drivers/dahdi/wcte11xp.c U linux/branches/2.2/drivers/dahdi/wcte12xp/base.c U linux/branches/2.2/include/dahdi/dahdi_config.h ------------------------------------------------------------------------ r6864 | sruffell | 2009-07-21 13:11:56 -0500 (Tue, 21 Jul 2009) | 53 lines Merged revisions 6844,6852,6862-6863 via svnmerge from https://origsvn.digium.com/svn/dahdi/linux/trunk ........ r6844 | sruffell | 2009-07-16 12:29:53 -0500 (Thu, 16 Jul 2009) | 10 lines wcte12xp: Disable vpmadt032 companding by default. This fixes a regression in 2.2.0 where certain configurations will fail patloop test or have repeated HDLC aborts because the VPMADT032 is modifying the clear channel or d channel data streams. This restores the behavior to how it was in dahdi-linux 2.1.0.4. (closes issue DAHLIN-126) Reported by: alecdavis Tested by: alecdavis ........ r6852 | tzafrir | 2009-07-19 10:45:40 -0500 (Sun, 19 Jul 2009) | 12 lines tor2: allow using port4 as timing source Fix a silly regression introduced when strict check on the timing parameter was added (sync-1 is the array index, not sync itself. And 0 is a special case). (closes issue DAHLIN-120) Reported by: dferrer Patches: tor2-4th_sync.patch uploaded by dferrer (license 525) ........ r6862 | sruffell | 2009-07-21 12:52:59 -0500 (Tue, 21 Jul 2009) | 4 lines Revert "wct4xxp, wcte11xp: Use the default configuration by default at startup." This reverts the change introduced by revision 6712. This change can cause problems when there is a VPM module installed on the quad-span digital cards. ........ r6863 | sruffell | 2009-07-21 12:53:02 -0500 (Tue, 21 Jul 2009) | 12 lines dahdi-base: Add support for core timing. This essentially moves the function of dahdi_dummy into the core of DAHDI. It ensures that if DAHDI is loaded, it will always be able to provide timing, regardless of whether there are board drivers loaded, or if the board drivers are properly calling dahdi_receive. If there is a master span loaded which is calling dahdi_receive, then the behavior will be like it is normally. This functionality is off by default, uncomment CONFIG_DAHDI_CORE_TIMER in include/dahdi/config_dahdi.h in order to enable it. ........ ------------------------------------------------------------------------ http://svn.digium.com/view/dahdi?view=rev&revision=6864 By: viniciusfontes (viniciusfontes) 2009-11-10 10:31:36.000-0600 Problem still happens when using DAHDI-Linux 2.2.0.2 and DAHDI-Tools 2.2.0. The problem only happens when all these criteria are met: - E1 (ccs,hdb3) - ISDN protocol (I'm using signalling=euroisdn) - VPMADT032 is physically present When the VPMADT032 is physically removed the problem no longer happens. Actually I had to remove it in order to get a stable link with the telco. Also tried: echo 0 > /sys/module/wcte12xp/parameters/vpmsupport But the problem still happens. By: Shaun Ruffell (sruffell) 2009-11-10 10:35:30.000-0600 vinicusfontes: What happens when you add "options wcte12xp vpmsupport=0" to /etc/modprobe.d/dahdi and the VPMADT032 is physically present? (Unfortunately, the file permissions on the module parameters are wrong....echoing to the vpmsupport parameter after the fact will not disable the VPM. It should be read only). By: Shaun Ruffell (sruffell) 2009-11-10 10:35:48.000-0600 Also, could you try with revision 7550 of the 2.2 branch? By: viniciusfontes (viniciusfontes) 2009-11-10 10:37:04.000-0600 Good to know! I'll be able to test this in about four hours and I'll post the results here. By: Shaun Ruffell (sruffell) 2009-11-13 11:36:51.000-0600 viniciusfontes: any results? By: viniciusfontes (viniciusfontes) 2009-11-13 11:42:24.000-0600 I set up two boxes on my lab in order to reproduce the problem before getting the svn version. So far, over 100000 calls and no problem at all. Both systems are running DAHDI 2.2.0.2 and one of them has a VPMADT032. Maybe something went wrong when I updated that problematic system to 2.2.0.2 and the wcte12xp.ko didn't get replaced with the new version. dahdi_cfg -tv still reported DAHDI-Linux as 2.2.0.2 and DAHDI-tools as 2.2.0 thought. I recompiled DAHDI and Asterisk, used diff to compare the module at /usr/src and the one at /lib/modules, this time they are identical. Asked the customer to install the VPMADT032 again, and will report as soon as he do it. Looks like it was a false alarm (and for that I apologize in advance) but hey, better like this, isn't it? :) By: Shaun Ruffell (sruffell) 2009-11-13 11:51:44.000-0600 viniciusfontes: thanks for the update (and sorry about that last post twice, I don't know what happened there). Although, if you're running DAHDI 2.2.0.2, it may be possible to run into the sort of problems recently resolved as part of DAHLIN-135. Would you be able to update to the head of the 2.2 branch? We're going to make a 2.2.1-rc1 tag soon after we're able to run through a set of regression tests in our lab here. By: viniciusfontes (viniciusfontes) 2009-11-13 12:05:22.000-0600 I guess you're referring to DAHDI not detecting the 2100hz tone to disable all EC right? Unfortunely the E1 board running on my PBX is a TE110P, with no support for VPMADT032. I'm not sure if that issue happens even without a VPMADT032 attached, but if it does I would be glad to test it. By: Leif Madsen (lmadsen) 2010-04-13 13:50:11 I just noticed 2.3.0 will be going out soon, but this is a marked as a blocker for the next set of releases. Is this really a blocker or should the severity be knocked down? By: Shaun Ruffell (sruffell) 2010-04-13 14:54:58 I think this was resolved in 2.2.1. Reopen if I'm mistaken. Thanks. |