Skip to content

Piero Bosio Social Web Site Personale Logo Fediverso

Social Forum federato con il resto del mondo. Non contano le istanze, contano le persone

lol, "if only someone had warned us about this sort of thing?!"

Uncategorized
42 31 3

Gli ultimi otto messaggi ricevuti dalla Federazione
Post suggeriti
  • 0 Votes
    1 Posts
    8 Views
    Language models cannot reliably distinguish belief from knowledge and factAbstract-----------«As language models (LMs) increasingly infiltrate into high-stakes domains such as law, medicine, journalism and science, their ability to distinguish belief from knowledge, and fact from fiction, becomes imperative. Failure to make such distinctions can mislead diagnoses, distort judicial judgments and amplify misinformation. Here we evaluate 24 cutting-edge LMs using a new KaBLE benchmark of 13,000 questions across 13 epistemic tasks. Our findings reveal crucial limitations. In particular, all models tested systematically fail to acknowledge first-person false beliefs, with GPT-4o dropping from 98.2% to 64.4% accuracy and DeepSeek R1 plummeting from over 90% to 14.4%. Further, models process third-person false beliefs with substantially higher accuracy (95% for newer models; 79% for older ones) than first-person false beliefs (62.6% for newer; 52.5% for older), revealing a troubling attribution bias. We also find that, while recent models show competence in recursive knowledge tasks, they still rely on inconsistent reasoning strategies, suggesting superficial pattern matching rather than robust epistemic understanding. Most models lack a robust understanding of the factive nature of knowledge, that knowledge inherently requires truth. These limitations necessitate urgent improvements before deploying LMs in high-stakes domains where epistemic distinctions are crucial.»#ai #LLMs #epistemology #knowledgehttps://www.nature.com/articles/s42256-025-01113-8
  • 0 Votes
    5 Posts
    10 Views
    @paraplegic_racehorse Excellent point! The article explicitly acknowledges this: AI doesn't 'choose' Markdown intentionally—the pattern emerges from training data (GitHub, Stack Overflow, technical docs). Org-mode is indeed equally human-readable, but its smaller footprint in training corpora explains the difference. This actually reinforces the broader point: format adoption creates network effects that AI amplifies.
  • 0 Votes
    1 Posts
    13 Views
    Grandiose day! However I haven’t read the book yet thou’. I spent my evening today building a automation by forking Tinker hack Blusky integration for running playlist etc. I am halfway there but when I run tested it – the last block doesn’t seem to work so I’ve contacted them about it. I hope soon to get help. This year I did lot of techie projects for myself like vibe coding built ways for me after taking course on ai and then vibecoding snake game and all of that. Later, doing html/css course to understand the basics and loving it all and then jamming up with neocities more. This year I also got excited with Nothing community project challenge – and built a marketing plan even the date was done and all. As I got to know late – being so inspired by the brand I went ahead and did something cool too. Nothing has cool software and hardware. The UI is cool as they say it. Got back to the ropes of smartphones as I grabbed Vivo iQOO Z10X as my daily driver – as they call it! I’ve since loved watching tech videos alot. I do love tech. I am not very much diving into fitness gadgets as of yet. As that would just tumble me down of anxiety passage I find.Well, the Founder of Mastodon is giving the next baton to new guy. Felix! Also, Eugen did a wonderful job with Mastodon. I saw his interview on YouTube while dinnering my way up through the day. I also read how a 101 year old Barista has been serving since WWII – that was the good news over bsky this morning. Also one 87 year old lady did something cool – which I don’t recall. You know we don’t care much to remember good news as we keep bad news itched in our brain for like whole time and all. Bad news is impactful is it? It creates fear and then that becomes memorable all of a sudden. Ah!
  • 0 Votes
    1 Posts
    11 Views
    In 2024, TU Wien presented the world's first #nuclear clock. Now it has been demonstrated that the #technology can also be used to investigate unresolved questions in fundamental physics. #Physics #sflorghttps://www.sflorg.com/2025/10/phy10272501.html