Morphological Rules Pertaining to Persian Spell Checking

Preface
«Analyzing Persian texts as some stemmer algorithms is essential for efficient spell checking because: It provides the level of consistency needed and It may work with a concise lexicon.
In this article the morphological rules pertaining to such algorithms are studied.»
 Introduction
In Persian words are extensively combined with various prefixes and suffixes, to make new words. In this sense, and if we define words digitally as strings of characters surrounded by space, the number of Persian words are enormously larger as compared to Latin languages as English. For example the word كتاب (ketab=book) generates following derivatives:

ketab_ha books
ketab_am my book, I am a book 
ketab_at your book 
ketab_ash his book 
ketab_eman our book 
ketab_eshan their book 
ketab_i a book, you are a book, related to books
ketab_im we are books 
ketab_id you are books 
ketab_and they are books 
ketab_itar more related to books 
ketab_itarin most related to books 
ketab_hayam my books 
ketab_hayat your books 
ketab_hayash his/her books
ketab_hayeman our books
ketab_hayetan your books
ketab_hayeshan their books
ketab_haei some books
   
*the suffixes are presented just as they spelled in Persian.

 As seen in this example 19 different words can be made by the simple root «ketab».

 The term Morphological Rules then refers to such rules in Persian that specify how new words can be made. It should be noted here that, by making words, we do not mean the process of generating totally new words as it is usually meant in Persian literature. Actually no one talks about ‹ketab_ha› as a new word made from ‹ketab›. This is because our digital definition of word: «a string of letters separated by space»
Thus, here we are confined rather to those simple and certain rules that are thought to be useful in the process of digital proofing.

The term curtain is important because, we are not going to consider about those patterns that are rarely used. We consider those rules that can be applied almost in all cases. Nevertheless, the rules are applicable to words based on their grammatical natures. For example you cannot pluralize a pronoun, or only verbs can be conjugated.

Thus, it should be assumed that the Morphological Rules studied here are supported by some Lexicon in which Morphemes are stored with flags that designate their grammatical nature as pertaining to stated rules. The terms Flag, and Morpheme in this article refers to such Lexicon…

Find the remaining on the following link.

Download Complete Document

12 نظر برای “Morphological Rules Pertaining to Persian Spell Checking

  1. Having read this I thought it was really informative. I appreciate you spending
    some time and energy to put this information together.
    I once again find myself spending a lot of time both reading
    and posting comments. But so what, it was still worthwhile!

  2. Undeniably believe that which you said. Your favorite justification seemed to be on the internet the
    easiest thing to be aware of. I say to you, I definitely get irked while people consider worries that they just do not know about.
    You managed to hit the nail upon the top and also defined
    out the whole thing without having side-effects , people could take a signal.
    Will probably be back to get more. Thanks

  3. I just like the valuable information you provide for your articles.
    I will bookmark your blog and check once more here regularly.
    I am rather sure I’ll be told many new stuff right right here! Best of luck for the next!

  4. Fantastic site you have here but I was curious about if you
    knew of any discussion boards that cover the same topics talked about
    in this article? I’d really love to be a part of community where I can get comments from other knowledgeable individuals that share the same interest. If you have any suggestions, please let me know. Appreciate it!

  5. After looking over a number of the blog articles on your
    blog, I seriously appreciate your technique
    of blogging. I book marked it to my bookmark site list and will be checking back soon.
    Take a look at my website too and let me know your opinion.

  6. Good day! I could have sworn I’ve been to your blog before but after looking at a few of the articles I realized it’s new to me.
    Regardless, I’m certainly delighted I came across it and I’ll be book-marking it and checking back often!

  7. I just couldn’t leave your website before suggesting that I extremely enjoyed the usual information a person supply for your guests? Is gonna be again ceaselessly in order to check up on new posts

  8. I rarely leave a response, however I looked at a lot of remarks
    on Morphological Rules Pertaining to Persian Spell Checking |
    Babak Mahmoudi’s Blog. I do have a couple of questions for you if you do not mind. Is it only me or does it look like some of these remarks appear as if they are left by brain dead people? 😛 And, if you are posting at other online social sites, I’d like to follow you.
    Could you list of all of all your shared pages like your linkedin profile, Facebook page or twitter feed?

برای Denny پاسخی بگذارید لغو پاسخ