Punycode Encoding / Decoding
See more Encryption Examples
Punycode is an encoding standard for representing Unicode characters using only the 7bit us-ascii characters that are permitted in network host names. Punycode is used for internationalized domain names -- i.e. IDN or IDNA (Internationalizing Domain Names in Applications).Punycode is defined in RFC 3492. Converting to/from punycode does not include the "xn--" prefix. The "xn--" prefix is to signify that punycode follows. For example, the string " café.com" is converted to "caf-dma.com" in punycode. The punycode domain name is "xn--caf-dma.com".
Converting an email address to punycode would be as follows. Suppose the email address is "coffee@café.com". The punycode representation is "coffee@xn--caf-dma.com". The RFC 3492 punycode representation of "café.com" is simply "caf-dma.com", but the punycode domain name is "xn--caf-dma.com".
The "xn--" is a constant. It is the same regardless of the domain. For example, the punycode URL representation of "mañana.com" is "xn--maana-pta.com".
Chilkat Unicode C Downloads
#include <C_CkStringBuilderW.h>
void ChilkatSample(void)
{
BOOL success;
HCkStringBuilderW sb;
success = FALSE;
sb = CkStringBuilderW_Create();
// Load the string "café" from a utf-8 text file.
success = CkStringBuilderW_LoadFile(sb,L"qa_data/txt/cafe.txt",L"utf-8");
CkStringBuilderW_PunyEncode(sb);
wprintf(L"%s\n",CkStringBuilderW_getAsString(sb));
CkStringBuilderW_PunyDecode(sb);
wprintf(L"%s\n",CkStringBuilderW_getAsString(sb));
// The output is:
//
// caf-dma
// café
CkStringBuilderW_Dispose(sb);
}