Welcome to the Roman Gelembjuk's blog
Categorization of financial news by industry sector

Categorization (classification) of documents is very interesting Text Mining task.

Recently i had tried to create a tool that will categorize business news with industry . There are couple of different industry categories sets used in financial institutions.

I have used the set from Yahoo Finance

Last Updated on Thursday, 14 October 2010 19:51
How to extract key phrases from a text

Extracting keyphrases from a text is more complex task then extracting keywords from a text. But, usually, keyphrases are more useful in text mining tasks then just keywords.

I have created Perl script that extracts keyphrases from the text.


Last Updated on Monday, 09 August 2010 16:21
Extracting useful content from Web-resource. Technologies review.

There is review of techniques of extracting important content from web-resources.

It is not secret that Internet giants like Google, Microsoft or Yahoo already have powerful technologies that can extract important content from web-pages. But big corporations are not interested in publishing of their technologies.

If to try to find information on this question we can find 2 types of resources - scientific publications and blog posts and articles of individuals that did some small tools for their needs.

Let see what information we can find now for the subject "Extracting useful (important) information from web pages".

Last Updated on Wednesday, 21 April 2010 14:22
Most quick way to access different file clouds on Linux

I have tested a lot of tool and services to store files online. I chosen SMEStorage.com and use it now.

I found then most problem of online storage services is speed of accessing file.

If you need to get some file from your cloud you must to:

  • start browser
  • open site (ex. smestorage.com)
  • login to site
  • open page with files
  • sometimes you must to browse folders and this is new page/data loading for each folder

What to do if you have to get 1 file very quick and with bad Internet connection?

Last Updated on Thursday, 15 April 2010 13:35
SMEStorage Joomla component important fix

There was Local File Inclusion Vulnerability in SMEStorage Joomla component.

Please download updated version 1.1 and reinstall if you use version 1.0 .

Many thanks to  LatinHackTeam  for found this and letting know.

<< Start < Prev 1 2 3 4 5 6 Next > End >>

Page 3 of 6