From arjunaraoc at googlemail.com Sun Apr 26 01:23:41 2009 From: arjunaraoc at googlemail.com (arjuna rao chavala) Date: Sun, 26 Apr 2009 13:53:41 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 for ex: www.sakshi.com (Telugu newspaper) Message-ID: <8426eb510904260123w564151d9m19c16dfe75620197@mail.gmail.com> Hi, After experimenting for some time with Padma and browsing through the mail archives, I realized that if the page is encoded in unicode (utf-8), it is not converted by Padma. Please check the previous thread The one year old Sakshi Telugu news paper is using utf-8 with proprietary code points. Like Eenadu, extraneous characters (double dagger in case of Sakshi and Hyphen in case of Eenadu) appear, when the page is rendered with proprietary font. >From a font perspective, the proprietary font is 99% similar to AndhrayJyothy. Any tips on making the conversion dependent on URL rather than the encoding. This will also solve a problem with certain websites which do not work, when the encoding is not automatically determined by the browser. Thanks Arjun From vsrawat at gmail.com Sun Apr 26 01:29:55 2009 From: vsrawat at gmail.com (V S Rawat) Date: Sun, 26 Apr 2009 13:59:55 +0530 Subject: [Padma] Unicode to Kruti Dev 010 conversion method Message-ID: <49F41B83.6030204@gmail.com> How can I convert text written in Unicode font to Kruti Dev 010 font? Does Padma do it, in existing form or with some modifications, or any other free ready software available? Thanks. -- Rawat From gora at sarai.net Sun Apr 26 01:39:56 2009 From: gora at sarai.net (Gora Mohanty) Date: Sun, 26 Apr 2009 14:09:56 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 for ex: www.sakshi.com (Telugu newspaper) In-Reply-To: <8426eb510904260123w564151d9m19c16dfe75620197@mail.gmail.com> References: <8426eb510904260123w564151d9m19c16dfe75620197@mail.gmail.com> Message-ID: <20090426140956.38602881@mail.sarai.net> On Sun, 26 Apr 2009 13:53:41 +0530 arjuna rao chavala wrote: [...] > The one year old Sakshi Telugu news paper is using utf-8 with proprietary > code points. Like Eenadu, extraneous characters (double dagger in case of > Sakshi and Hyphen in case of Eenadu) appear, when the page is rendered with > proprietary font. [...] This does not make sense to me, because if the encoding claimed to be UTF-8, but was really some proprietary encoding, no browser (nor any rendering engine that did not have that proprietary encoding specifically built in) would be able to render the text properly. Is it possible instead that their UTF-8 is somehow messed up? Do you have links to UTF-8 encoded pages that render properly in one browser, but not in another? Regards, Gora From gora at sarai.net Sun Apr 26 01:43:28 2009 From: gora at sarai.net (Gora Mohanty) Date: Sun, 26 Apr 2009 14:13:28 +0530 Subject: [Padma] Unicode to Kruti Dev 010 conversion method In-Reply-To: <49F41B83.6030204@gmail.com> References: <49F41B83.6030204@gmail.com> Message-ID: <20090426141328.16d9620f@mail.sarai.net> On Sun, 26 Apr 2009 13:59:55 +0530 V S Rawat wrote: > How can I convert text written in Unicode font to Kruti Dev 010 font? > > Does Padma do it, in existing form or with some modifications, or any > other free ready software available? Not sure about Padma, but you could try Sorry for the horrendously long link, but that is how they provided it. Have not looked at it in detail myself. Regards, Gora From vsrawat at gmail.com Sun Apr 26 01:57:55 2009 From: vsrawat at gmail.com (V S Rawat) Date: Sun, 26 Apr 2009 14:27:55 +0530 Subject: [Padma] Unicode to Kruti Dev 010 conversion method In-Reply-To: <20090426141328.16d9620f@mail.sarai.net> References: <49F41B83.6030204@gmail.com> <20090426141328.16d9620f@mail.sarai.net> Message-ID: <49F42213.1060801@gmail.com> On 4/26/2009 2:13 PM India Time, _Gora Mohanty_ wrote: > On Sun, 26 Apr 2009 13:59:55 +0530 > V S Rawat wrote: > >> How can I convert text written in Unicode font to Kruti Dev 010 font? >> >> Does Padma do it, in existing form or with some modifications, or any >> other free ready software available? > > Not sure about Padma, but you could try > > Sorry for the horrendously long link, but that is how they > provided it. > > Have not looked at it in detail myself. > > Regards, > Gora Thanks but the page is coming like ??? ? ?? C $ ? & G 5 f g f ?8 G ?/ B ( ?  K ! ?+ < K # M  ?* 0 ? 5 0 M $ ? $ M 0 ?(?1?3?-?1?-?2?0?0?8? ?1?6?:?3?3?)? ??? ? ? ? ? ? ? ? ?L?a?s?t? ?U?p?d?a?t?e? ?:? ?2?6?-?1?-?2?0?0?9? ?1?4?:?3?0? ? ? ? ? References: <49F41B83.6030204@gmail.com> <20090426141328.16d9620f@mail.sarai.net> <49F42213.1060801@gmail.com> Message-ID: <49F4228C.2020804@gmail.com> On 4/26/2009 2:27 PM India Time, _V S Rawat_ wrote: > On 4/26/2009 2:13 PM India Time, _Gora Mohanty_ wrote: > >> On Sun, 26 Apr 2009 13:59:55 +0530 >> V S Rawat wrote: >> >>> How can I convert text written in Unicode font to Kruti Dev 010 font? >>> >>> Does Padma do it, in existing form or with some modifications, or any >>> other free ready software available? >> Not sure about Padma, but you could try >> >> Sorry for the horrendously long link, but that is how they >> provided it. >> >> Have not looked at it in detail myself. >> >> Regards, >> Gora > > > Thanks but the page is coming like > > ??? ? ?? C $ ? & G 5 f g f ?8 G ?/ B ( ?  K ! ?+ > < K # M  ?* 0 ? 5 0 M $ ? $ M 0 ?(?1?3?-?1?-?2?0?0?8? ?1?6?:?3?3?)? > ??? ? ? ? ? ? ? ? ?L?a?s?t? ?U?p?d?a?t?e? ?:? > ?2?6?-?1?-?2?0?0?9? ?1?4?:?3?0? ? ? ? ? > in unicode as well as western page encoding. Even other pages are coming > all messed up even with all scripts allowed in ff3. > > how to read it correctly? > Sorry, seems I replied too early. When I opened that url in ie, that opened perfectly well and provided the online converter. Thanks for it. Let me try it. Any standalone? offline converter? or any method of using that page offline? -- Rawat > From vsrawat at gmail.com Sun Apr 26 02:04:05 2009 From: vsrawat at gmail.com (V S Rawat) Date: Sun, 26 Apr 2009 14:34:05 +0530 Subject: [Padma] Unicode to Kruti Dev 010 conversion method In-Reply-To: <49F4228C.2020804@gmail.com> References: <49F41B83.6030204@gmail.com> <20090426141328.16d9620f@mail.sarai.net> <49F42213.1060801@gmail.com> <49F4228C.2020804@gmail.com> Message-ID: <49F42385.8040208@gmail.com> On 4/26/2009 2:29 PM India Time, _V S Rawat_ wrote: > On 4/26/2009 2:27 PM India Time, _V S Rawat_ wrote: > >> On 4/26/2009 2:13 PM India Time, _Gora Mohanty_ wrote: >> >>> On Sun, 26 Apr 2009 13:59:55 +0530 >>> V S Rawat wrote: >>> >>>> How can I convert text written in Unicode font to Kruti Dev 010 font? >>>> >>>> Does Padma do it, in existing form or with some modifications, or any >>>> other free ready software available? >>> Not sure about Padma, but you could try >>> >>> Sorry for the horrendously long link, but that is how they >>> provided it. >>> >>> Have not looked at it in detail myself. >>> >>> Regards, >>> Gora >> >> Thanks but the page is coming like >> >> ??? ? ?? C $ ? & G 5 f g f ?8 G ?/ B ( ?  K ! ?+ >> < K # M  ?* 0 ? 5 0 M $ ? $ M 0 ?(?1?3?-?1?-?2?0?0?8? ?1?6?:?3?3?)? >> ??? ? ? ? ? ? ? ? ?L?a?s?t? ?U?p?d?a?t?e? ?:? >> ?2?6?-?1?-?2?0?0?9? ?1?4?:?3?0? ? ? ? ?> >> in unicode as well as western page encoding. Even other pages are coming >> all messed up even with all scripts allowed in ff3. >> >> how to read it correctly? >> > > Sorry, seems I replied too early. When I opened that url in ie, that > opened perfectly well and provided the online converter. > > Thanks for it. Let me try it. > > Any standalone? offline converter? or any method of using that page offline? Nope. that is converting one way from Kruti Dev 010 to Unicode. I want the other way conversion, from Unicode to Kruti Dev 010 thanks. -- Rawat From gora at sarai.net Sun Apr 26 05:39:17 2009 From: gora at sarai.net (Gora Mohanty) Date: Sun, 26 Apr 2009 18:09:17 +0530 Subject: [Padma] Unicode to Kruti Dev 010 conversion method In-Reply-To: <49F42385.8040208@gmail.com> References: <49F41B83.6030204@gmail.com> <20090426141328.16d9620f@mail.sarai.net> <49F42213.1060801@gmail.com> <49F4228C.2020804@gmail.com> <49F42385.8040208@gmail.com> Message-ID: <20090426180917.1cce5f56@mail.sarai.net> On Sun, 26 Apr 2009 14:34:05 +0530 V S Rawat wrote: [...] > >> Thanks but the page is coming like > >> > >> ??? ? ?? C $ ? & G 5 f g f ?8 G ?/ B ( ?  K ! ?+ > >> < K # M  ?* 0 ? 5 0 M $ ? $ M 0 ?(?1?3?-?1?-?2?0?0?8? ?1?6?:?3?3?)? > >> ??? ? ? ? ? ? ? ? ?L?a?s?t? ?U?p?d?a?t?e? ?:? > >> ?2?6?-?1?-?2?0?0?9? ?1?4?:?3?0? ? ? ? ? >> > >> in unicode as well as western page encoding. Even other pages are coming > >> all messed up even with all scripts allowed in ff3. > >> > >> how to read it correctly? [...] The encoding for the page seems to be UTF-16. My Firefox auto-detects this. Try setting your encoding manually, or turn auto-detect on. > Nope. that is converting one way from Kruti Dev 010 to Unicode. > > I want the other way conversion, from Unicode to Kruti Dev 010 > > thanks. [...] Um, what do you see on the page? There are two text boxes, the top one for conversion from Krutidev-010 to Unicode, and the bottom one for the reverse conversion. Regards, Gora From arjunaraoc at googlemail.com Mon Apr 27 08:07:44 2009 From: arjunaraoc at googlemail.com (arjuna rao chavala) Date: Mon, 27 Apr 2009 20:37:44 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 Message-ID: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> Hi Gora, My comments prefixed with double greater than >> and embedded in your email. Thanks Arjun From: Gora Mohanty Subject: Re: [Padma] How to handle proprietary fonts encoded in utf-8 for ex: www.sakshi.com (Telugu newspaper) To: padma at mozdev.org Message-ID: <20090426140956.38602881 at mail.sarai.net> Content-Type: text/plain; charset=US-ASCII On Sun, 26 Apr 2009 13:53:41 +0530 arjuna rao chavala wrote: [...] > The one year old Sakshi Telugu news paper is using utf-8 with proprietary > code points. Like Eenadu, extraneous characters (double dagger in case of > Sakshi and Hyphen in case of Eenadu) appear, when the page is rendered with > proprietary font. [...] This does not make sense to me, because if the encoding claimed to be UTF-8, but was really some proprietary encoding, no browser (nor any rendering engine that did not have that proprietary encoding specifically built in) would be able to render the text properly. >> You can download the font that was utilized from the URL home page and see for yourself on Firefox. Is it possible instead that their UTF-8 is somehow messed up? Do you have links to UTF-8 encoded pages that render properly in one browser, but not in another? >> I have verified the hex data, it is a proper utf-8, once you extract, the code points (1 byte or 2 byte) point to the font glyphs (truetype). Regards, Gora From gmhossain at gmail.com Mon Apr 27 08:36:37 2009 From: gmhossain at gmail.com (Golam Mortuza Hossain) Date: Mon, 27 Apr 2009 12:36:37 -0300 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> Message-ID: <141c154a0904270836j6a79738bk950edd9304482597@mail.gmail.com> Hi, On Mon, Apr 27, 2009 at 12:07 PM, arjuna rao chavala wrote: >> The one year old Sakshi ?Telugu news paper ?is using utf-8 with proprietary >> code points. ?Like Eenadu, extraneous characters (double dagger ?in case of >> Sakshi and Hyphen in case of Eenadu) appear, when the page is rendered > with >> proprietary font. > [...] > This does not make sense to me, because if the encoding > claimed to be UTF-8, but was really some proprietary > encoding, no browser (nor any rendering engine that did > not have that proprietary encoding specifically built in) > would be able to render the text properly. >>> You can download the font that was utilized from the URL home page and > see for yourself on Firefox. I have seen such situations for couple of Bengali sites as well. They use UTF-8 encoding but their character map is non-standard as they use code-points from other languages. (I wonder how could a company hire such developers who essentially screw their own site in Google search (Bengali texts appears to be mix of Bengali + Chinese character) but still get paid!) BTW, Padma can handle them without any issue. Just re-map those non-standard character to standard Unicode. For example: I used XXX.codepoint_E502 = "\uE502" ; XXX.toPadma[XXX.codepoint_E502] = Padma.vowelsn_AI ; Cheers, Golam From gora at sarai.net Mon Apr 27 10:49:36 2009 From: gora at sarai.net (Gora Mohanty) Date: Mon, 27 Apr 2009 23:19:36 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <141c154a0904270836j6a79738bk950edd9304482597@mail.gmail.com> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> <141c154a0904270836j6a79738bk950edd9304482597@mail.gmail.com> Message-ID: <20090427231936.04cfd333@mail.sarai.net> On Mon, 27 Apr 2009 12:36:37 -0300 Golam Mortuza Hossain wrote: [...] > I have seen such situations for couple of Bengali sites as well. > They use UTF-8 encoding but their character map is non-standard > as they use code-points from other languages. Ah, thanks for the clarification. That makes more sense, and is actually OK, though probably not good practice. > (I wonder how could a company hire such developers who > essentially screw their own site in Google search (Bengali texts > appears to be mix of Bengali + Chinese character) but still get > paid!) [...] Agreed on the lack of standardisation by most so-called web developers. However, I don't see how one would get Chinese, as no Indian language characters should be mapped there in Unicode. Can you point me to any examples? Regards, Gora From gora at sarai.net Mon Apr 27 10:49:35 2009 From: gora at sarai.net (Gora Mohanty) Date: Mon, 27 Apr 2009 23:19:35 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> Message-ID: <20090427231935.7249204b@mail.sarai.net> On Mon, 27 Apr 2009 20:37:44 +0530 arjuna rao chavala wrote: > Hi Gora, > > My comments prefixed with double greater than >> and embedded in your email. [...] > >> You can download the font that was utilized from the URL home page and > see for yourself on Firefox. Would it be possible for you to send a specific link? I cannot read Telugu, and it would be difficult for me to go looking for mis-renderings. [...] > >> I have verified the hex data, it is a proper utf-8, once you extract, the > code points (1 byte or 2 byte) point to the font glyphs (truetype). [...] Can you figure out what Unicode codepoint the UTF-8 corresponds to? An editor like yudit (http://yudit.org) should be able to tell you if you cut and paste UTF-8 into it. As someone else has suggested, is it possible that the page is using Unicode codepoints for some other script? Regards, Gora From gmhossain at gmail.com Mon Apr 27 11:18:14 2009 From: gmhossain at gmail.com (Golam Mortuza Hossain) Date: Mon, 27 Apr 2009 15:18:14 -0300 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <20090427231936.04cfd333@mail.sarai.net> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> <141c154a0904270836j6a79738bk950edd9304482597@mail.gmail.com> <20090427231936.04cfd333@mail.sarai.net> Message-ID: <141c154a0904271118q5278869ajf8c72ecd30f5fa52@mail.gmail.com> Hi, On Mon, Apr 27, 2009 at 2:49 PM, Gora Mohanty wrote: > Agreed on the lack of standardisation by most so-called web > developers. However, I don't see how one would get Chinese, > as no Indian language characters should be mapped there in > Unicode. Can you point me to any examples? > Unfortunately, that precisely these sites do. Here is an example http://www.amardeshbd.com They basically assign every Bengali conjunct a separate codepoint which is in violation of Unicode standard. I may be wrong in saying whether those characters are exactly Chinese but from the appearance of the glyphs, it looks similar. Cheers, Golam From arjunaraoc at googlemail.com Tue Apr 28 10:23:57 2009 From: arjunaraoc at googlemail.com (arjuna rao chavala) Date: Tue, 28 Apr 2009 22:53:57 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <20090427231935.7249204b@mail.sarai.net> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> <20090427231935.7249204b@mail.sarai.net> Message-ID: <8426eb510904281023v3b034020u3d062e70d216706c@mail.gmail.com> No need to be aware of language. Just have a glance at www.eenadu.net with Padma and www.sakshi.com with proprietary font loaded. You will see extraneous characters (double dagger with www.sakshi.com. On Mon, Apr 27, 2009 at 11:19 PM, Gora Mohanty wrote: > On Mon, 27 Apr 2009 20:37:44 +0530 > arjuna rao chavala wrote: > > > Hi Gora, > > > > My comments prefixed with double greater than >> and embedded in your > email. > [...] > > >> You can download the font that was utilized from the URL home page and > > see for yourself on Firefox. > > Would it be possible for you to send a specific link? I > cannot read Telugu, and it would be difficult for me to > go looking for mis-renderings. > > [...] > > >> I have verified the hex data, it is a proper utf-8, once you extract, > the > > code points (1 byte or 2 byte) point to the font glyphs (truetype). > [...] > > Can you figure out what Unicode codepoint the UTF-8 corresponds to? > An editor like yudit (http://yudit.org) should be able to tell you > if you cut and paste UTF-8 into it. As someone else has suggested, > is it possible that the page is using Unicode codepoints for some > other script? spent enough time deciphering utf-8 and mapping to font code points. utf-8 is used just as file format and no linkage with unicode is needed for the same. > > > Regards, > Gora > _______________________________________________ > Padma mailing list > Padma at mozdev.org > https://www.mozdev.org/mailman/listinfo/padma > From arjunaraoc at googlemail.com Tue Apr 28 10:36:56 2009 From: arjunaraoc at googlemail.com (arjuna rao chavala) Date: Tue, 28 Apr 2009 23:06:56 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <141c154a0904270836j6a79738bk950edd9304482597@mail.gmail.com> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> <141c154a0904270836j6a79738bk950edd9304482597@mail.gmail.com> Message-ID: <8426eb510904281036yff443b9wc1907e0fe196eb6e@mail.gmail.com> When i make changes in an existing .js file for test purposes, the script does not seem to run with a test file. Even the automatic conversion that worked before for the usual sites no longer works. I am following the instructions as given in http://tidbits.co.in/content/how-write-padma-conversion-file Thanks Arjun On Mon, Apr 27, 2009 at 9:06 PM, Golam Mortuza Hossain wrote: > Hi, > > > On Mon, Apr 27, 2009 at 12:07 PM, arjuna rao chavala > wrote: > >> The one year old Sakshi Telugu news paper is using utf-8 with > proprietary > >> code points. Like Eenadu, extraneous characters (double dagger in case > of > >> Sakshi and Hyphen in case of Eenadu) appear, when the page is rendered > > with > >> proprietary font. > > [...] > > This does not make sense to me, because if the encoding > > claimed to be UTF-8, but was really some proprietary > > encoding, no browser (nor any rendering engine that did > > not have that proprietary encoding specifically built in) > > would be able to render the text properly. > >>> You can download the font that was utilized from the URL home page and > > see for yourself on Firefox. > > > I have seen such situations for couple of Bengali sites as well. > They use UTF-8 encoding but their character map is non-standard > as they use code-points from other languages. > (I wonder how could a company hire such developers who > essentially screw their own site in Google search (Bengali texts > appears to be mix of Bengali + Chinese character) but still get > paid!) > > BTW, Padma can handle them without any issue. Just re-map > those non-standard character to standard Unicode. > > For example: I used > > XXX.codepoint_E502 = "\uE502" ; > XXX.toPadma[XXX.codepoint_E502] = Padma.vowelsn_AI ; > > > Cheers, > Golam > _______________________________________________ > Padma mailing list > Padma at mozdev.org > https://www.mozdev.org/mailman/listinfo/padma > From harshita at atc.tcs.com Tue Apr 28 22:36:59 2009 From: harshita at atc.tcs.com (Harshita) Date: Wed, 29 Apr 2009 11:06:59 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <8426eb510904281036yff443b9wc1907e0fe196eb6e@mail.gmail.com> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> <141c154a0904270836j6a79738bk950edd9304482597@mail.gmail.com> <8426eb510904281036yff443b9wc1907e0fe196eb6e@mail.gmail.com> Message-ID: <1240983419.23874.12.camel@localhost.localdomain> Hi, Check the Error Console, it might help in debugging the problem. One more thing, saakshi's font SHREE-0908W is looking similar to shree-tel-0900 except for some changes. Modifying the mappings will speed up the development process I hope. regards, Harshita On Tue, 2009-04-28 at 23:06 +0530, arjuna rao chavala wrote: > When i make changes in an existing .js file for test purposes, the script > does not seem to run with a test file. Even the automatic conversion that > worked before for the usual sites no longer works. > > I am following the instructions as given in > http://tidbits.co.in/content/how-write-padma-conversion-file > > Thanks > Arjun > > On Mon, Apr 27, 2009 at 9:06 PM, Golam Mortuza Hossain > wrote: > > > Hi, > > > > > > On Mon, Apr 27, 2009 at 12:07 PM, arjuna rao chavala > > wrote: > > >> The one year old Sakshi Telugu news paper is using utf-8 with > > proprietary > > >> code points. Like Eenadu, extraneous characters (double dagger in case > > of > > >> Sakshi and Hyphen in case of Eenadu) appear, when the page is rendered > > > with > > >> proprietary font. > > > [...] > > > This does not make sense to me, because if the encoding > > > claimed to be UTF-8, but was really some proprietary > > > encoding, no browser (nor any rendering engine that did > > > not have that proprietary encoding specifically built in) > > > would be able to render the text properly. > > >>> You can download the font that was utilized from the URL home page and > > > see for yourself on Firefox. > > > > > > I have seen such situations for couple of Bengali sites as well. > > They use UTF-8 encoding but their character map is non-standard > > as they use code-points from other languages. > > (I wonder how could a company hire such developers who > > essentially screw their own site in Google search (Bengali texts > > appears to be mix of Bengali + Chinese character) but still get > > paid!) > > > > BTW, Padma can handle them without any issue. Just re-map > > those non-standard character to standard Unicode. > > > > For example: I used > > > > XXX.codepoint_E502 = "\uE502" ; > > XXX.toPadma[XXX.codepoint_E502] = Padma.vowelsn_AI ; > > > > > > Cheers, > > Golam > > _______________________________________________ > > Padma mailing list > > Padma at mozdev.org > > https://www.mozdev.org/mailman/listinfo/padma > > > _______________________________________________ > Padma mailing list > Padma at mozdev.org > https://www.mozdev.org/mailman/listinfo/padma From sourabindu at gmail.com Wed Apr 29 00:37:20 2009 From: sourabindu at gmail.com (Sourabindu Dutta) Date: Wed, 29 Apr 2009 13:07:20 +0530 Subject: [Padma] How to downlaod an converted html pages with padma add on from terminal Message-ID: Hi, Is there any way to downlaod a converted html page with indic font transformed to indic font? I think wget will not serve the purpose in linux. Regards, Sourabindu From saravanannkl at gmail.com Wed Apr 29 16:47:46 2009 From: saravanannkl at gmail.com (Saravana) Date: Wed, 29 Apr 2009 19:47:46 -0400 Subject: [Padma] How to downlaod an converted html pages with padma add on from terminal In-Reply-To: References: Message-ID: You can try XULRunner (https://developer.mozilla.org/En/XULRunner) with padma. But you need to add XULRunner support for padma, which might be difficult, if you are new to Mozilla platform. Thanks Saravana On Wed, Apr 29, 2009 at 3:37 AM, Sourabindu Dutta wrote: > Hi, > Is there any way to downlaod a converted html page with indic font > transformed to indic font? I think wget will not serve the purpose in linux. > Regards, > Sourabindu From arjunaraoc at googlemail.com Wed Apr 29 09:51:29 2009 From: arjunaraoc at googlemail.com (arjuna rao chavala) Date: Wed, 29 Apr 2009 22:21:29 +0530 Subject: [Padma] How to handle proprietary fonts encoded in utf-8 In-Reply-To: <1240983419.23874.12.camel@localhost.localdomain> References: <8426eb510904270807s66e84c07k47ac3c90f4ea3b25@mail.gmail.com> <141c154a0904270836j6a79738bk950edd9304482597@mail.gmail.com> <8426eb510904281036yff443b9wc1907e0fe196eb6e@mail.gmail.com> <1240983419.23874.12.camel@localhost.localdomain> Message-ID: <8426eb510904290951g41399a76ne4472d21917ab083@mail.gmail.com> I have done the mapping. I am not able to understand error message. I enclose .js file with sakshi font mapping changes to be copied as ShreeTel0900.js and a sample sakhi html file with font html code set to Shree-tel-0900 Thanks Arjun On Wed, Apr 29, 2009 at 11:06 AM, Harshita wrote: > Hi, > > Check the Error Console, it might help in debugging the problem. > > One more thing, saakshi's font SHREE-0908W is looking similar to > shree-tel-0900 except for some changes. Modifying the > mappings will speed up the development process I hope. > > regards, > Harshita > > On Tue, 2009-04-28 at 23:06 +0530, arjuna rao chavala wrote: > > When i make changes in an existing .js file for test purposes, the > script > > does not seem to run with a test file. Even the automatic conversion that > > worked before for the usual sites no longer works. > > > > I am following the instructions as given in > > http://tidbits.co.in/content/how-write-padma-conversion-file > > > > Thanks > > Arjun > > > > On Mon, Apr 27, 2009 at 9:06 PM, Golam Mortuza Hossain > > wrote: > > > > > Hi, > > > > > > > > > On Mon, Apr 27, 2009 at 12:07 PM, arjuna rao chavala > > > wrote: > > > >> The one year old Sakshi Telugu news paper is using utf-8 with > > > proprietary > > > >> code points. Like Eenadu, extraneous characters (double dagger in > case > > > of > > > >> Sakshi and Hyphen in case of Eenadu) appear, when the page is > rendered > > > > with > > > >> proprietary font. > > > > [...] > > > > This does not make sense to me, because if the encoding > > > > claimed to be UTF-8, but was really some proprietary > > > > encoding, no browser (nor any rendering engine that did > > > > not have that proprietary encoding specifically built in) > > > > would be able to render the text properly. > > > >>> You can download the font that was utilized from the URL home page > and > > > > see for yourself on Firefox. > > > > > > > > > I have seen such situations for couple of Bengali sites as well. > > > They use UTF-8 encoding but their character map is non-standard > > > as they use code-points from other languages. > > > (I wonder how could a company hire such developers who > > > essentially screw their own site in Google search (Bengali texts > > > appears to be mix of Bengali + Chinese character) but still get > > > paid!) > > > > > > BTW, Padma can handle them without any issue. Just re-map > > > those non-standard character to standard Unicode. > > > > > > For example: I used > > > > > > XXX.codepoint_E502 = "\uE502" ; > > > XXX.toPadma[XXX.codepoint_E502] = Padma.vowelsn_AI ; > > > > > > > > > Cheers, > > > Golam > > > _______________________________________________ > > > Padma mailing list > > > Padma at mozdev.org > > > https://www.mozdev.org/mailman/listinfo/padma > > > > > _______________________________________________ > > Padma mailing list > > Padma at mozdev.org > > https://www.mozdev.org/mailman/listinfo/padma > > _______________________________________________ > Padma mailing list > Padma at mozdev.org > https://www.mozdev.org/mailman/listinfo/padma >