From gora at sarai.net Sat May 2 11:28:27 2009 From: gora at sarai.net (Gora Mohanty) Date: Sat, 2 May 2009 23:58:27 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <141c154a0904271118q5278869ajf8c72ecd30f5fa52@mail.gmail.com> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> <141c154a0904270836j6a79738bk950edd9304482597@mail.gmail.com> <20090427231936.04cfd333@mail.sarai.net> <141c154a0904271118q5278869ajf8c72ecd30f5fa52@mail.gmail.com> Message-ID: <20090502235827.413498fd@mail.sarai.net> On Mon, 27 Apr 2009 15:18:14 -0300 Golam Mortuza Hossain wrote: [...] > Unfortunately, that precisely these sites do. Here is an example > > http://www.amardeshbd.com > > They basically assign every Bengali conjunct a separate codepoint > which is in violation of Unicode standard. I may be wrong in saying > whether those characters are exactly Chinese but from the appearance > of the glyphs, it looks similar. [...] I finally ran into such an example on a page displayed by Google News. At least in that case, it turns out to be a character from the Unicode private use area. To the best of my knowledge, there is nothing that can be done to fix this in a generic way. The Unicode FAQ entry on the private use area specifically disclaims any responsibility: http://unicode.org/faq/casemap_charprop.html#8 So, the best solution is probably what you suggested earlier, i.e., if you know what the character is supposed to be, special-case it for Padma. Regards, Gora From gora at sarai.net Sat May 2 11:37:58 2009 From: gora at sarai.net (Gora Mohanty) Date: Sun, 3 May 2009 00:07:58 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <8426eb510904281023v3b034020u3d062e70d216706c@mail.gmail.com> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> <20090427231935.7249204b@mail.sarai.net> <8426eb510904281023v3b034020u3d062e70d216706c@mail.gmail.com> Message-ID: <20090503000758.248d18a1@mail.sarai.net> On Tue, 28 Apr 2009 22:53:57 +0530 arjuna rao chavala wrote: > No need to be aware of language. Just have a glance at www.eenadu.net with > Padma and www.sakshi.com with proprietary font loaded. You will see > extraneous characters (double dagger with www.sakshi.com. [...] Neither of these pages seem to be UTF-8 encoded. http://www.eenadu.net/ has charset=windows-1252, and charset=ISO-8859-1, and my Firefox3 recognises it as windows-1252. Of course, it does not have the proper font, so cannot show the characters correctly. http://www.sakshi.com/ does not specify a charset, so my Firefox defaults to UTF-8, but that is obviously wrong. Downloading the font, SW908.TTF, shows Telugu characters in places reserved for ASCII, and nothing in the places reserved for Telugu, so it is obviously non-Unicode. The portions that look OK seem to be images. However, one should be able to map this font for Padma. Regards, Gora From arjunaraoc at googlemail.com Sun May 3 06:57:08 2009 From: arjunaraoc at googlemail.com (arjuna rao chavala) Date: Sun, 3 May 2009 19:27:08 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <20090503000758.248d18a1@mail.sarai.net> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> <20090427231935.7249204b@mail.sarai.net> <8426eb510904281023v3b034020u3d062e70d216706c@mail.gmail.com> <20090503000758.248d18a1@mail.sarai.net> Message-ID: <8426eb510905030657p2b0b1aeeo719d982b069b9170@mail.gmail.com> On Sun, May 3, 2009 at 12:07 AM, Gora Mohanty wrote: > On Tue, 28 Apr 2009 22:53:57 +0530 > arjuna rao chavala wrote: > > > No need to be aware of language. Just have a glance at www.eenadu.netwith > > Padma and www.sakshi.com with proprietary font loaded. You will see > > extraneous characters (double dagger with www.sakshi.com. > [...] > > Neither of these pages seem to be UTF-8 encoded. > http://www.eenadu.net/ has charset=windows-1252, > and charset=ISO-8859-1, and my Firefox3 recognises > it as windows-1252. Of course, it does not have > the proper font, so cannot show the characters > correctly. My point was not about eenadu > > > http://www.sakshi.com/ does not specify a charset, > so my Firefox defaults to UTF-8, but that is obviously > wrong. Downloading the font, SW908.TTF, shows Telugu > characters in places reserved for ASCII, and nothing in > the places reserved for Telugu, so it is obviously > non-Unicode. The portions that look OK seem to be images. > However, one should be able to map this font for > Padma. > Please select non utf-8 as encoding and see that Sakshi does not display telugu properly in many of the places. > > Regards, > Gora > _______________________________________________ > Padma mailing list > Padma at mozdev.org > https://www.mozdev.org/mailman/listinfo/padma > From gora at sarai.net Sun May 3 11:34:55 2009 From: gora at sarai.net (Gora Mohanty) Date: Mon, 4 May 2009 00:04:55 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <8426eb510905030657p2b0b1aeeo719d982b069b9170@mail.gmail.com> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> <20090427231935.7249204b@mail.sarai.net> <8426eb510904281023v3b034020u3d062e70d216706c@mail.gmail.com> <20090503000758.248d18a1@mail.sarai.net> <8426eb510905030657p2b0b1aeeo719d982b069b9170@mail.gmail.com> Message-ID: <20090504000455.5b39b60e@mail.sarai.net> On Sun, 3 May 2009 19:27:08 +0530 arjuna rao chavala wrote: > On Sun, May 3, 2009 at 12:07 AM, Gora Mohanty wrote: > My point was not about eenadu Um, OK. > > http://www.sakshi.com/ does not specify a charset, > > so my Firefox defaults to UTF-8, but that is obviously > > wrong. Downloading the font, SW908.TTF, shows Telugu > > characters in places reserved for ASCII, and nothing in > > the places reserved for Telugu, so it is obviously > > non-Unicode. The portions that look OK seem to be images. > > However, one should be able to map this font for > > Padma. > > > Please select non utf-8 as encoding and see that Sakshi does not display > telugu properly in many of the places. [...] That is the point. There is no standardised encoding that the site seems to follow, so (without Padma) the only way to get a standards-compliant browser to render it properly is to download their font, with the proprietary encoding, and always use that for this site. However, this is exactly the problem that Padma was designed to solve. If you make appropriate tables for Padma, using the SW908.TTF font used on the site, Padma will automagically convert it to UTF-8 for you. Regards, Gora From haridallas at gmail.com Sat May 16 20:27:37 2009 From: haridallas at gmail.com (hariharan Ramamurthy) Date: Sat, 16 May 2009 22:27:37 -0500 Subject: [Padma] please give support for sakshi Message-ID: <46D68105766B48449EE20F7815424BBB@hari1956> One of the upcoming and major online paper sakshi font SW908 please support RTS conversion for this website This is useful for a research project I am doing Thanks Dr. Hariharan haridallas at gmail.com From vsrawat at gmail.com Sat May 16 20:37:28 2009 From: vsrawat at gmail.com (V S Rawat) Date: Sun, 17 May 2009 09:07:28 +0530 Subject: [Padma] please give support for sakshi In-Reply-To: <46D68105766B48449EE20F7815424BBB@hari1956> References: <46D68105766B48449EE20F7815424BBB@hari1956> Message-ID: <4A0F8678.70501@gmail.com> Could you mention some link of the website or where the information of this project is there or where sakshi font can be downloaded along with some text in it. Thanks. -- Rawat On 5/17/2009 8:57 AM India Time, _hariharan Ramamurthy_ wrote: > One of the upcoming and major online paper sakshi font > > SW908 please support RTS conversion for this website > > > > This is useful for a research project I am doing > > > > Thanks > Dr. Hariharan > haridallas at gmail.com > > _______________________________________________ > Padma mailing list > Padma at mozdev.org > https://www.mozdev.org/mailman/listinfo/padma From arjunaraoc at googlemail.com Mon May 18 10:54:54 2009 From: arjunaraoc at googlemail.com (arjuna rao chavala) Date: Mon, 18 May 2009 23:24:54 +0530 Subject: [Padma] please give support for sakshi In-Reply-To: <46D68105766B48449EE20F7815424BBB@hari1956> References: <46D68105766B48449EE20F7815424BBB@hari1956> Message-ID: <8426eb510905181054t5c8a5ed1mf66a103e9ff28559@mail.gmail.com> I have attempted and shared the difficulties in a previous post on this list. As of now, Linux users are able to read Sakshi (without padma) after installing the font, though there were some unnecessary symbols in between Telugu text, Regards Arjun On Sun, May 17, 2009 at 8:57 AM, hariharan Ramamurthy wrote: > One of the upcoming and major online paper sakshi font > > SW908 please support RTS conversion for this website > > > > This is useful for a research project I am doing > > > > Thanks > > > > Dr. Hariharan > > haridallas at gmail.com > > _______________________________________________ > Padma mailing list > Padma at mozdev.org > https://www.mozdev.org/mailman/listinfo/padma > From harshita at atc.tcs.com Tue May 19 00:47:34 2009 From: harshita at atc.tcs.com (harshita at atc.tcs.com) Date: Tue, 19 May 2009 13:17:34 +0530 (IST) Subject: [Padma] please give support for sakshi In-Reply-To: <8426eb510905181054t5c8a5ed1mf66a103e9ff28559@mail.gmail.com> References: <46D68105766B48449EE20F7815424BBB@hari1956> <8426eb510905181054t5c8a5ed1mf66a103e9ff28559@mail.gmail.com> Message-ID: Hi Arjun, In one of your previous mails you mentioned about enclosing .js file but I couldn't find it. Can you please send it again. regards, harshita > I have attempted and shared the difficulties in a previous post on this > list. > As of now, Linux users are able to read Sakshi (without padma) after > installing the font, though there were some unnecessary symbols in > between Telugu text, > > Regards > Arjun > On Sun, May 17, 2009 at 8:57 AM, hariharan Ramamurthy > wrote: > >> One of the upcoming and major online paper sakshi font >> >> SW908 please support RTS conversion for this website >> >> >> >> This is useful for a research project I am doing >> >> >> >> Thanks >> >> >> >> Dr. Hariharan >> >> haridallas at gmail.com >> >> _______________________________________________ >> Padma mailing list >> Padma at mozdev.org >> https://www.mozdev.org/mailman/listinfo/padma >> > _______________________________________________ > Padma mailing list > Padma at mozdev.org > https://www.mozdev.org/mailman/listinfo/padma > From vsrawat at gmail.com Sat May 16 04:22:52 2009 From: vsrawat at gmail.com (V S Rawat) Date: Sat, 16 May 2009 16:52:52 +0530 Subject: [Padma] Any conversion program to convert Arjun and Kalidas font to Unicode? Message-ID: <4A0EA20C.1020109@gmail.com> Could anyone suggest any conversion program to convert Arjun-wide font and Kalidas font to Unicode? I myself have one to convert from arjun to unicode but this font is arjun-wd so I guess its character map would be different. Thanks. -- rawat From arjunaraoc at googlemail.com Wed May 20 09:16:36 2009 From: arjunaraoc at googlemail.com (arjuna rao chavala) Date: Wed, 20 May 2009 21:46:36 +0530 Subject: [Padma] please give support for sakshi In-Reply-To: References: <46D68105766B48449EE20F7815424BBB@hari1956> <8426eb510905181054t5c8a5ed1mf66a103e9ff28559@mail.gmail.com> Message-ID: <8426eb510905200916r14a4be7t6252b76a415f02ee@mail.gmail.com> sent privately to Harshita's email id. Regards Arjun On Tue, May 19, 2009 at 1:17 PM, wrote: > Hi Arjun, > > In one of your previous mails you mentioned about enclosing .js file but > I couldn't find it. Can you please send it again. > > regards, > harshita > > > > I have attempted and shared the difficulties in a previous post on this > > list. > > As of now, Linux users are able to read Sakshi (without padma) after > > installing the font, though there were some unnecessary symbols in > > between Telugu text, > > > > Regards > > Arjun > > On Sun, May 17, 2009 at 8:57 AM, hariharan Ramamurthy > > wrote: > > > >> One of the upcoming and major online paper sakshi font > >> > >> SW908 please support RTS conversion for this website > >> > >> > >> > >> This is useful for a research project I am doing > >> > >> > >> > >> Thanks > >> > >> > >> > >> Dr. Hariharan > >> > >> haridallas at gmail.com > >> > >> _______________________________________________ > >> Padma mailing list > >> Padma at mozdev.org > >> https://www.mozdev.org/mailman/listinfo/padma > >> > > _______________________________________________ > > Padma mailing list > > Padma at mozdev.org > > https://www.mozdev.org/mailman/listinfo/padma > > > > > _______________________________________________ > Padma mailing list > Padma at mozdev.org > https://www.mozdev.org/mailman/listinfo/padma > From harshita at atc.tcs.com Wed May 20 23:49:57 2009 From: harshita at atc.tcs.com (harshita at atc.tcs.com) Date: Thu, 21 May 2009 12:19:57 +0530 (IST) Subject: [Padma] Any conversion program to convert Arjun and Kalidas font to Unicode? In-Reply-To: <4A0EA20C.1020109@gmail.com> References: <4A0EA20C.1020109@gmail.com> Message-ID: Hi Rawat, If you have .ttf files for both the fonts, you can compare them easily by using fontforge tool for similarity. you have mentioned about having a converter for arjun font. If it can be a part of padma (i.e in the form of.js file in padma) you can test the conversion by replacing fontface to "arjun-wd"(in place arjun). By looking at the the names arjun & arjunwide I guess character maps might be similar. I have seen fonts like ManjushaMedium, ManjushaBold and priyaanka, priyaankabold which has the similarity. regards, harshita > Could anyone suggest any conversion program to convert Arjun-wide font > and Kalidas font to Unicode? > I myself have one to convert from arjun to unicode but this font is > arjun-wd so I guess its character map would be different. > > Thanks. > -- > rawat > _______________________________________________ > Padma mailing list > Padma at mozdev.org > https://www.mozdev.org/mailman/listinfo/padma >