2005-06-06
More Work on the Search Engine
If you've tried my search engine, you've probably noticed that it's been down almost as much as it's been up. That's because I've been doing a lot of work on it, adding more functionality, better relevance algorithms, etc. It now uses a pagerank-style algorithm that gives documents greater importance based on two things: the number of incoming links, and the "points" awarded by pages that link to it. Pages that have more incoming links award more points to the pages they link to. These general importance factors, along with various methods to determine how well the query matches the document, are calculated with some manually optimized weight values, and the resulting "relevance number" is given to the result. Finally, results are ordered, sliced up to pages, and displayed.
Oh yeah, I still have to add paging. Oh well, I have a bad habit of putting off the easy stuff. For now, you can just add "start" and "num" query string parameters to get your page.
0 comments
Comment moderation policy: Your comment will be reviewed before it is added to the site. This is in response to spam and other forms of abuse. I gladly accept comments containing criticism as long as the language is clean.
This weblog is powered by Blogger.