Posts

Showing posts from December, 2011

About Us Page Finder in Python

Few months ago, I wrote a Python code that tries to find the about us page given a URL. You can test it here: http://208.43.85.103/cgi-bin/aboutus/main.py I have used some heuristics from my observation to write the code. But as I have started to learn machine learning ( http://www.ml-class.org ), I am thinking to apply machine learning technique and rewrite this. I shall try to do it later, may be last week of December. Before doing this, I would like to improve my current program by incorporating more logic (but no ml technique). I shall publish the current algorithm soon here, but first it would be great if you can test it and give me some feedback and suggestions.