Jump to content
php.lv forumi
Sign in to follow this  
daGrevis

Porter stemmer

Recommended Posts

Ir tāda ļoti noderīga lieta kā "Porter stemmer". Kas tas ir? Citēju...

 

The Porter stemming algorithm (or ‘Porter stemmer’) is a process for removing the commoner morphological and inflexional endings from words in English. Its main use is as part of a term normalisation process that is usually done when setting up Information Retrieval systems.

 

Citiem vārdiem sakot, tas ir skripts, kas vārdu, piemēram, "apples" pārveidos kā "apple". Šis algoritms ir noderīgs, piemēram, veidojot meklētājus lapai. Šeit algoritms ir pielāgots PHP valodai!

 

Problēma... šis algoritms ir angļu valodai. Latviešu valodā gramatikas likumi ir daudz savādāki. Es teiktu, pat krāšņāki! Vai ir kāds guru, kas spētu to pielāgot arī latviešu valodai? Varbūt kas tāds jau ir gatavs! Kāds padalīsies?

Share this post


Link to post
Share on other sites

esam izmantojuši hunspell, lai no lv valodas dabūtu ārā vārdus nominatīvā. varbūt noder.

Share this post


Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
Sign in to follow this  

×
×
  • Create New...