Skip to content

Piero Bosio Social Web Site Personale Logo Fediverso

Social Forum federato con il resto del mondo. Non contano le istanze, contano le persone

#decemberAdventure day 9: back to #gopher

Uncategorized
3 1 14
  • day 9: back to

    A number of years ago I setup my own search tool for the gopherspace. I've not re-done a full crawl since getting it working, and due to some design issues, it eventually stopped working reliably enough to keep running. Due to RSI issues, I have prioritized other projects in the intervening years, but have slowly made a list of notes and plans for fixing it. Today's adventure is the first part in implementing my plan and getting it back to a useable state.

    I've rewritten the crawler. The new one is a lot less buggy than the original, and has a number of improvement including a correctly working filter (supporting robots.txt and a defined list of servers not to index), better discovery of servers from the gopher maps, tracking when servers were last scanned, request rate limiting, and facilities for avoiding recording duplicate entries.

    My initial tests have been on my main gopher server (forthworks.com:70) and a number of my private ones. This totals 32k selectors across 3 servers. I'm going to start a broader scan of the public gopherspace soon, so will update once I get through the initial scan of a few servers.

    My full logs are at https://charles.childe.rs/DA2025

  • day 9: back to

    A number of years ago I setup my own search tool for the gopherspace. I've not re-done a full crawl since getting it working, and due to some design issues, it eventually stopped working reliably enough to keep running. Due to RSI issues, I have prioritized other projects in the intervening years, but have slowly made a list of notes and plans for fixing it. Today's adventure is the first part in implementing my plan and getting it back to a useable state.

    I've rewritten the crawler. The new one is a lot less buggy than the original, and has a number of improvement including a correctly working filter (supporting robots.txt and a defined list of servers not to index), better discovery of servers from the gopher maps, tracking when servers were last scanned, request rate limiting, and facilities for avoiding recording duplicate entries.

    My initial tests have been on my main gopher server (forthworks.com:70) and a number of my private ones. This totals 32k selectors across 3 servers. I'm going to start a broader scan of the public gopherspace soon, so will update once I get through the initial scan of a few servers.

    My full logs are at https://charles.childe.rs/DA2025

    Update on the initial scan: 91 servers (of 488 found in the indexes) scanned, 397 pending, 11 unreachable. 579,936 selectors with 2,193,285 descriptions. Data set is 569MiB in size.

    I'm stopping my scans for today, will resume them tomorrow.

  • stefano@mastodon.bsd.cafeundefined stefano@mastodon.bsd.cafe shared this topic on
  • Update on the initial scan: 91 servers (of 488 found in the indexes) scanned, 397 pending, 11 unreachable. 579,936 selectors with 2,193,285 descriptions. Data set is 569MiB in size.

    I'm stopping my scans for today, will resume them tomorrow.

    Update on my scan of gopherspace.

    725 servers identified
    84 unreachable
    6 restricted due to robots.txt or manual exclusion requests
    325 scanned completely
    1 in progress
    393 pending

    1,269,190 unique selectors, 5,699,901 descriptions.

    The scan will continue (slowly). I'm going to start writing the new front end for doing searches of the collected data next week.


Gli ultimi otto messaggi ricevuti dalla Federazione
  • Kiev: “In 24 ore i russi hanno lanciato 668 attacchi nella regione di Zaporizhzhia”

    Nelle ultime 24 ore, le forze russe hanno effettuato 668 attacchi contro 32 insediamenti nella regione di Zaporizhzhia. Sei persone sono rimaste ferite a seguito di attacchi nemici nello stesso distretto. Lo ha riferito su Telegram Ivan Fedorov, capo dell'amministrazione militare regionale di Zaporizhzhia, scrive Ukrinform.
    Repubblica

    read more

  • @MikeTheGray ma partecipasse a titolo personale non da premier...chessò...da cortigiana!

    read more

  • @matz i treni in Grecia mi sembrano peggiori che in Italia come comodità, e sono Gruppo Ferrovie dello Stato italiane

    read more

  • This post did not contain any content.
    read more

  • Quindi nel CSM vigerebbe un sistema para mafioso secondo Nordio? Essendo il CSM presieduto dal Presidente della Repubblica, questi sarebbe a capo di un sistema para mafioso?
    Apperò, non si vince a spararle più grosse, si fa solo brutta figura

    read more

  • @tante boosting for reach. Consider editing the post to add Gibraltar as a tag, might help reach more.

    read more

  • Chi é ora che si comporta con atteggiamento "eversivo"?.

    read more

  • @cstross It's weird to me how the UK insists that its constituent parts are "countries" not provinces. Similar to how the USA insists its provinces are actually "states".

    Otherwise, this whole situation is pretty similar for most countries. One big province where most of the people live, especially the rich people, that bullies everyone else: Ontario, Ile de France, CDMX, Tokyo. A rural province with a much smaller population that often feels bullied (often this is where you find separatist movements and/or anti-immigrant movements). And then there's the problem child that just doesn't fit in: Quebec, French Guiana, Norfolk Island, Puerto Rico, Xinjiang...

    read more
Post suggeriti