Skip to content

Piero Bosio Social Web Site Personale Logo Fediverso

Social Forum federato con il resto del mondo. Non contano le istanze, contano le persone

A thought that popped into my head when I woke up at 4 am and couldn’t get back to sleep…

Uncategorized
66 35 317

Gli ultimi otto messaggi ricevuti dalla Federazione
Post suggeriti
  • 0 Votes
    1 Posts
    8 Views
    Language models cannot reliably distinguish belief from knowledge and factAbstract-----------«As language models (LMs) increasingly infiltrate into high-stakes domains such as law, medicine, journalism and science, their ability to distinguish belief from knowledge, and fact from fiction, becomes imperative. Failure to make such distinctions can mislead diagnoses, distort judicial judgments and amplify misinformation. Here we evaluate 24 cutting-edge LMs using a new KaBLE benchmark of 13,000 questions across 13 epistemic tasks. Our findings reveal crucial limitations. In particular, all models tested systematically fail to acknowledge first-person false beliefs, with GPT-4o dropping from 98.2% to 64.4% accuracy and DeepSeek R1 plummeting from over 90% to 14.4%. Further, models process third-person false beliefs with substantially higher accuracy (95% for newer models; 79% for older ones) than first-person false beliefs (62.6% for newer; 52.5% for older), revealing a troubling attribution bias. We also find that, while recent models show competence in recursive knowledge tasks, they still rely on inconsistent reasoning strategies, suggesting superficial pattern matching rather than robust epistemic understanding. Most models lack a robust understanding of the factive nature of knowledge, that knowledge inherently requires truth. These limitations necessitate urgent improvements before deploying LMs in high-stakes domains where epistemic distinctions are crucial.»#ai #LLMs #epistemology #knowledgehttps://www.nature.com/articles/s42256-025-01113-8
  • 0 Votes
    1 Posts
    7 Views
    A remarkably prophetic 1923 cartoon depicting how a creative process would be automated in 2023.#cartoon #tech #technology #BigTech #AI #ArtificialIntelligence #LLM #LLMs #MachineLearning #GenAI #generativeAI #AISlop #Meta #Google #gemini #OpenAI #ChatGPT #anthropic #claude
  • 0 Votes
    2 Posts
    10 Views
    Also, there is an enjoyment, a compulsion, a jouissance, in the act of speaking that does nothing to help prompt any consideration of the chain of signifiers we use. This is great for psychoanalysis. Great for in-group social bonding, but not so good for communication that can bridge gaps.Writing, which is usually slower, which comes with many more rules than speaking - spelling, grammar, etc. essentially slows down the process and gives opportunity for consideration of impact and resonance 2/
  • 0 Votes
    3 Posts
    10 Views
    @cosothegreat si va bene lo aggiornerò nei prossimi giorni