Sarah Jamie Lewis on Nostr: The more I think about search engines and compiling and weighting corpora, the more ...
The more I think about search engines and compiling and weighting corpora, the more inclined I am to implement hard signal-filters i.e. assume all documents are spam to start with and only accept a document into the corpus if it can be shown to be unspam-like.
Published at
2024-05-27 20:18:38Event JSON
{
"id": "a3fbeb233b6278d5c3ea092d97293744eb4b009c3c4d355b4dd468c63e7ec947",
"pubkey": "aed322bb94e499cd0cd23bb4a6b2cc04bdb2031ab09fc08a36e3a2e355a22b11",
"created_at": 1716841118,
"kind": 1,
"tags": [
[
"proxy",
"https://mastodon.social/users/sarahjamielewis/statuses/112514899523343556",
"activitypub"
]
],
"content": "The more I think about search engines and compiling and weighting corpora, the more inclined I am to implement hard signal-filters i.e. assume all documents are spam to start with and only accept a document into the corpus if it can be shown to be unspam-like.",
"sig": "33a9ab87d4e3038dced228d84fa96b3052ab9779bff6dd8c0a3e312ad7b90092e041cba131a7fa5d52bf8896b48006ada5f573c5013ca609f611b1f513bf9696"
}