Preface
«Analyzing Persian texts as some stemmer algorithms is essential for efficient spell checking because: It provides the level of consistency needed and It may work with a concise lexicon.
In this article the morphological rules pertaining to such algorithms are studied.»
Introduction
In Persian words are extensively combined with various prefixes and suffixes, to make new words. In this sense, and if we define words digitally as strings of characters surrounded by space, the number of Persian words are enormously larger as compared to Latin languages as English. For example the word كتاب (ketab=book) generates following derivatives:
ketab_ha | books |
ketab_am | my book, I am a book |
ketab_at | your book |
ketab_ash | his book |
ketab_eman | our book |
ketab_eshan | their book |
ketab_i | a book, you are a book, related to books |
ketab_im | we are books |
ketab_id | you are books |
ketab_and | they are books |
ketab_itar | more related to books |
ketab_itarin | most related to books |
ketab_hayam | my books |
ketab_hayat | your books |
ketab_hayash | his/her books |
ketab_hayeman | our books |
ketab_hayetan | your books |
ketab_hayeshan | their books |
ketab_haei | some books |
*the suffixes are presented just as they spelled in Persian. |
As seen in this example 19 different words can be made by the simple root «ketab».
The term Morphological Rules then refers to such rules in Persian that specify how new words can be made. It should be noted here that, by making words, we do not mean the process of generating totally new words as it is usually meant in Persian literature. Actually no one talks about ‹ketab_ha› as a new word made from ‹ketab›. This is because our digital definition of word: «a string of letters separated by space»
Thus, here we are confined rather to those simple and certain rules that are thought to be useful in the process of digital proofing.
The term curtain is important because, we are not going to consider about those patterns that are rarely used. We consider those rules that can be applied almost in all cases. Nevertheless, the rules are applicable to words based on their grammatical natures. For example you cannot pluralize a pronoun, or only verbs can be conjugated.
Thus, it should be assumed that the Morphological Rules studied here are supported by some Lexicon in which Morphemes are stored with flags that designate their grammatical nature as pertaining to stated rules. The terms Flag, and Morpheme in this article refers to such Lexicon…
Find the remaining on the following link.
Having read this I thought it was really informative. I appreciate you spending
some time and energy to put this information together.
I once again find myself spending a lot of time both reading
and posting comments. But so what, it was still worthwhile!
This is my first time pay a quick visit at here and i am truly pleassant to
read everthing at alone place.
Undeniably believe that which you said. Your favorite justification seemed to be on the internet the
easiest thing to be aware of. I say to you, I definitely get irked while people consider worries that they just do not know about.
You managed to hit the nail upon the top and also defined
out the whole thing without having side-effects , people could take a signal.
Will probably be back to get more. Thanks
I’m not that much of a online reader to be honest but your blogs really nice,
keep it up! I’ll go ahead and bookmark your website to come back in the future. All the best
I just like the valuable information you provide for your articles.
I will bookmark your blog and check once more here regularly.
I am rather sure I’ll be told many new stuff right right here! Best of luck for the next!
Fantastic site you have here but I was curious about if you
knew of any discussion boards that cover the same topics talked about
in this article? I’d really love to be a part of community where I can get comments from other knowledgeable individuals that share the same interest. If you have any suggestions, please let me know. Appreciate it!
After looking over a number of the blog articles on your
blog, I seriously appreciate your technique
of blogging. I book marked it to my bookmark site list and will be checking back soon.
Take a look at my website too and let me know your opinion.
Good day! I could have sworn I’ve been to your blog before but after looking at a few of the articles I realized it’s new to me.
Regardless, I’m certainly delighted I came across it and I’ll be book-marking it and checking back often!
whoah this blog is excellent i really like studying your articles.
Keep up the great work! You recognize, a lot of persons are hunting
around for this info, you could help them greatly.
I just couldn’t leave your website before suggesting that I extremely enjoyed the usual information a person supply for your guests? Is gonna be again ceaselessly in order to check up on new posts
I rarely leave a response, however I looked at a lot of remarks
on Morphological Rules Pertaining to Persian Spell Checking |
Babak Mahmoudi’s Blog. I do have a couple of questions for you if you do not mind. Is it only me or does it look like some of these remarks appear as if they are left by brain dead people? 😛 And, if you are posting at other online social sites, I’d like to follow you.
Could you list of all of all your shared pages like your linkedin profile, Facebook page or twitter feed?
Collaborate with team-mates, partners, and customers with documents which are always current and available from very nearly anywhere.
microsoft office