From a84534956250badc05e9b190f1559309591c4f15 Mon Sep 17 00:00:00 2001 From: Santhosh Thottingal Date: Sat, 15 Aug 2009 14:21:12 +0530 Subject: The patterns for all languages made compatible with tex rules for hyphenation Don't break on either side of zwj/zwnj for all languages LEFTHYPHENMIN and RIGHTHYPHENMIN properties removed. It can be configured from applications --- hyphenation/hyph_or_IN.dic | 177 ++++++++++++++++++++++++++++----------------- 1 file changed, 109 insertions(+), 68 deletions(-) (limited to 'hyphenation/hyph_or_IN.dic') diff --git a/hyphenation/hyph_or_IN.dic b/hyphenation/hyph_or_IN.dic index c865b7b..09e1fbf 100755 --- a/hyphenation/hyph_or_IN.dic +++ b/hyphenation/hyph_or_IN.dic @@ -16,71 +16,112 @@ UTF-8 % License along with this library; if not, write to the Free Software % Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA % -LEFTHYPHENMIN 2 -RIGHTHYPHENMIN 2 -ଅ1 -ଆ1 -ଇ1 -ଈ1 -ଉ1 -ଊ1 -ଋ1 -ଏ1 -ଐ1 -ଔ1 -ା1 -ି1 -ୀ1 -ୁ1 -େ1 -ୋ1 -ୈ1 -ୌ1 -ୗ1 -୍2 -ଃ1 -ଂ1 -1ନ -ନ୍2 -2ନ୍‍ -1ର -ର୍2 -2ର୍‍ -1ଲ -ଲ୍2 -2ଲ୍‍ -1ଳ -ଳ୍2 -2ଳ୍‍ -1ଣ -ଣ୍2 -2ଣ୍‍ -1କ -1ଗ -1ଖ -1ଘ -1ଙ -1ଚ -1ଛ -1ଜ -1ଝ -1ଞ -1ଟ -1ଠ -1ଡ -1ଢ -1ତ -1ଥ -1ଦ -1ଧ -1ପ -1ଫ -1ବ -1ଭ -1ମ -1ଯ -1ଵ -1ଶ -1ଷ -1ସ -1ହ +% GENERAL RULE +% Do not break either side of ZERO-WIDTH JOINER +% (U+200D) and ZERO-WIDTH NON-JOINER (U+200C) +2‍2 +2‌2 +% Break before or after any independent vowel. +1ଅ1 +1ଆ1 +1ଇ1 +1ଈ1 +1ଉ1 +1ଊ1 +1ଋ1 +1ୠ1 +1ଌ1 +1ୡ1 +1ଏ1 +1ଐ1 +1ଓ1 +1ଔ1 +% Break after any dependent vowel, but not before. +2ା1 +2ି1 +2ୀ1 +2ୁ1 +2ୂ1 +2ୃ1 +2େ1 +2ୈ1 +2ୋ1 +2ୌ1 +% Break before or after any consonant. +1କ1 +1ଖ1 +1ଗ1 +1ଘ1 +1ଙ1 +1ଚ1 +1ଛ1 +1ଜ1 +1ଝ1 +1ଞ1 +1ଟ1 +1ଠ1 +1ଡ1 +1ଢ1 +1ଣ1 +1ତ1 +1ଥ1 +1ଦ1 +1ଧ1 +1ନ1 +1ପ1 +1ଫ1 +1ବ1 +1ଭ1 +1ମ1 +1ଯ1 +1ର1 +1ଲ1 +1ଳ1 +1ଵ1 +1ଶ1 +1ଷ1 +1ସ1 +1ହ1 +% Do not break before a final consonant or conjunct. +2କ୍. +2ଖ୍. +2ଗ୍. +2ଘ୍. +2ଙ୍. +2ଚ୍. +2ଛ୍. +2ଜ୍. +2ଝ୍. +2ଞ୍. +2ଟ୍. +2ଠ୍. +2ଡ୍. +2ଢ୍. +2ଣ୍. +2ତ୍. +2ଥ୍. +2ଦ୍. +2ଧ୍. +2ନ୍. +2ପ୍. +2ଫ୍. +2ବ୍. +2ଭ୍. +2ମ୍. +2ଯ୍. +2ର୍. +2୍. +2ଲ୍. +2ଳ୍. +2୍. +2ଵ୍. +2ଶ୍. +2ଷ୍. +2ସ୍. +2ହ୍. +% Do not break before anusvara, visarga and length mark. +2ଂ +2ଃ +2ୗ +% Do not break either side of virama (may be within conjunct). +2୍2 -- cgit