Hello friends, I want to found some algorithms for text processing.
I have a lots of entries in the database and now I want to split by category (news, history, sport, business etc...) but I don't know none algorithm(s) for text processing.

So, my question is, what is the most popular algorithms for text processing (split by categories, find most similar items etc...) ?

Thanks.

Hello friends, thanks for your suggestion, but my problem is solved using the library collective.classification. :)

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.