Great Circle Associates

XCIN Mail-list
(January 2001)


Indexed By Date: [Previous] [Next] Indexed By Thread: [Previous] [Next]

Subject: bims.c比較好點的patch版..
From: vgod <vgod@circ.tcfsh.tc.edu.tw>
Date: Mon, 8 Jan 2001 14:21:49 +0800 (CST)
To: xcin@linux.org.tw
Delivered-To: xcin-gate@tlug.sinica.edu.tw
Delivered-To: xcin-list@tlug.sinica.edu.tw
Reply-To: xcin@tlug.sinica.edu.tw

嗯 小弟我剛才又改了一下bims.c
這次改的比較完善了 也能避開大多數的破音字同詞頻的問題
我本來的改法只是改成每次都傳回字序中的第一個字
但這樣的話就喪失了原有詞頻的好處
所以現在改成如果選出來詞頻最高的字是破音字(有兩個以上的音)
那就送出原字序的第一個
如果不是破音字的話就還是送出詞頻最高的字
剛才測試了一下,像"快","或"這些都沒問題
一般的字也都能選出出現頻率較高的
效果還不錯,請居士大大看看..^^
應該可以作為libtabe沒有完全改進前的緩衝用途

以下是patch

333c333,334
< 
---
>   struct ZhiInfo zhi;
>   
340d340
< 
355d354
< 
357c356,361
<   strncpy((char *)z, (char *)str+i*2, 2);
---
>   zhi.code=code;
>   tabeZhiInfoLookupYin(&zhi);
>   if(zhi.yin[1]!=0) /* zhi has more than one yin */
>      strncpy((char *)z, (char *)str,2);
>   else
>      strncpy((char *)z, (char *)str+i*2,2);

END_OF_PATCH


To Unsubscribe: send mail to majordomo@linux.org.tw
with "unsubscribe xcin" in the body of the message


Indexed By Date Previous: Re: tsi.src change of this week (2)
From: "Tzu-hsien Yu" <thyu@ck.tp.edu.tw>
Next: utf-8 support
From: Brian Spilsbury <brian@designix.com.au>
Indexed By Thread Previous: Re: tsi.src change of this week (2)
From: william.bbs@openbazaar.net (何陋居主)
Next: Re: utf-8 support
From: thhsieh@tlug.sinica.edu.tw