What now G.722? SILK Speech Codec to rule them all?
Some good news from the read worthy Skype Journal: Skype SILK codec in the IETF standards process.
This IS major news. As Phil Wolff points out, one of the three obstacles for Skype is now being solved. In my opinion, the codec issue has been one of the most difficult for Skype - along with the Joltid issues.
From my perspective, one of the good thing about the SILK Speech Codec being put through an ITEF track is that the codec is now becoming open and usable for more than Skype users.
This could mean that in a not so distant future we will have SILK for Asterisk, making Skype For Asterisk much more interesting.
What really is worth taking a small peek at, is how this will effect the "other" Wideband Audio codec, namely G.722; But also the Wideband Audio eco system as a whole.
In what way could, or will, SILK compete with G.722? In one of the VoIP Users Conference calls someone said that G.722 is reasonable easy to implement compared to certain other codecs - but let's not forget that codec implementation is done once per unit/project. As far as I understand the SILK home page - there will be no source code available, but the SILK SDK is easily available. Given that the G.722 patent has expired, it's use should be comparable to the relaxed license Skype have in mind for SILK. What will become interesting if there will be any 3rd party implementations of SILK. This of course will depend on how many architectures the SDK will be available to; Currently x86 Windows or ARM Linux.
What about the eco system around Wideband Audio - will the eco system as a whole care? My guess is only hard core VoIP people will care - the rest of the eco system will be more concerned about the applications. However, what could happen when SILK becomes an IETF recommendation, is that we will end up with two islands of Wideband Audio: SILK and G.722. I really do hope this will not happen, we are on the verge on general deployment of Wideband Audio. We, as VoIP professionals, can simply not afford a Wideband Audio "split".
I do believe that for quite some time we will find SILK only in non-Skype software, but not in the major hardware vendors phone offerings. My guess is that SILK will first be supported by smaller hardware vendors (i.e. "Designed & Made in China). Of the more well known phone producers, Snom will probably one of the first.
A short background into SILK
The codec is developed by the Skype Audio team. The bitrate varies from 6 to 40 kilobit/second. Comparably, G.722 operate at 48, 56 and 64 kilobit/second.
Given that everything boils down to the bandwidth it's worth noting that SILK operates on 8, 16 and 24 kHz - this is higher than G.722. We should also note that SILK reaches a Mean Opinion Score (G.107) of 4.49 faster than other Wideband codecs (i.e. lower kilobit/second) - actually, some of the other Wideband codecs will not reach MOS=4.49. These numbers are taken from official SILK literature.
How about packet loss? When we loose packets, the MOS falls. What I have seen so far is that if the packet loss reaches 10%, the MOS is still much higher than comparable Wideband codecs.
Concluding remark
We should embrace SILK even if it's "made by Skype".
Skype is not "evil" and "closed" any more. Skype is opening up (Skype for Asterisk / Skype for Sip) - so releasing their codec into the wild is just the next logical step.
6 comments
Thanks for the comment.
You got me there, and found a big typo. Thank you very much for spotting this.
What I really wanted to write is that there will be no source code available.
I really do not see Skype going open source with the SILK codec pretty soon, but what I really hope for is that given the IETF track the codec is put through, maybe - just maybe, a 3rd party will do an implementation not far into the future.
Actually I have hopes for a 3rd party doing an implementation: The reason is that if a protocol to become an Internet standard there must be
1) at least two independent and interoperable implementations from different code bases (elevates to becoming a "Draft Standard"),
2) a high degree of technical maturity and by a generally held belief that the specified protocol or service provides significant benefit to the Internet community (become a "Internet Standard").
The current IETF document describing SILK has, as far as I understand, not yet reached the first IETF standards track maturity level ("Proposed Standard"). This maturity level states: A Proposed Standard specification is generally stable, has resolved known design choices, is believed to be well-understood, has received significant community review, and appears to enjoy enough community interest to be considered valuable. SILK has yet a long way to go before calling it self a Internet Standard.
SILK starts out back of the pack in that there are no devices supporting it. Hardware device support is crucial. That alone is what drives G.722, despite its lack of technical sophistication.
We already have several royalty-free standard wideband codecs in G.722, G722.1 & G.722.1C. Royalty free is not enough on it's own.
I completely agree with you that if you have a lossy compression scheme, MOS=5 can not be achieved. My bad for not being clearer.
What I probably meant to say is NOT that SILK as the time being is able to perform MOS=5 - but will probably reach MOS=5 in the future - faster than other wide band codecs.
What is true today, based on published data from Skype, is that SILK reaches MOS=4.49 faster than other wide band codecs. I have now amended the article to better suit the current state of things.
Thank you for pointing this error out to me, much appreciated.
https://developer.skype.com/silk
ietf draft:
http://tools.ietf.org/html/draft-vos-silk-01
This post has 1 feedback awaiting moderation...

22/09/09 08:36:29 pm,