I tried making my own search engine based on Nutch (sometime around 2000).
We created a directory of sites, with a strict selection, because. had experience in administering Dmoz, but these sites that were added to our catalog were indexed. Not deep, just 1 click from the main page.
But then we did not have enough capacity to carry out a good search.
Now I’m thinking about getting back to it. As part of one project that I am writing now (MIT license. Directory example: https://libarea.ru/web). We are experimenting, learning, facets for navigation, we will fasten the search later, there is still a lot of work.
I think for my next project I’m gunna be looking into setting up topic focused search engines, particularly for substance use and harm reduction.
https://www.cs.toronto.edu/~muuo/blog/build-yourself-a-mini-search-engine/
Doesn’t seem too hard, has anyone ever done something like this? Curious about overhead costs and the like.
I tried making my own search engine based on Nutch (sometime around 2000).
We created a directory of sites, with a strict selection, because. had experience in administering Dmoz, but these sites that were added to our catalog were indexed. Not deep, just 1 click from the main page.
But then we did not have enough capacity to carry out a good search.
Now I’m thinking about getting back to it. As part of one project that I am writing now (MIT license. Directory example: https://libarea.ru/web). We are experimenting, learning, facets for navigation, we will fasten the search later, there is still a lot of work.