Skip to content

Research

Page 16 of 21

Superstudent intelligence in thermodynamics

research note

Superstudent intelligence in thermodynamics

·7 min read·Rebecca Loubet, Pascal Zittlau, Marco Hoffmann et al.

This study reports a landmark evaluation of OpenAI's latest large language model, o3, on a challenging thermodynamics exam typically taken by university engineering students

researchlarge-language-modelthermodynamicsacademic-exam-evaluationzero-shot-learning

Read note → Source paper ↗

LeanTutor: Towards a Verified AI Mathematical Proof Tutor

research note

LeanTutor: Towards a Verified AI Mathematical Proof Tutor

·8 min read·Manooshree Patel, Rayna Bhattacharyya, Thomas Lu et al.

This paper presents LeanTutor, a proof-of-concept AI-based mathematical proof tutoring system that combines the language fluency of Large Language Models (LLMs) with the formal correctness guarante…

researchai-math-tutoringautoformalizationtheorem-provingnatural-language-feedback

Read note → Source paper ↗

Risks & Benefits of LLMs & GenAI for Platform Integrity, Healthcare Diagnostics, Financial Trust and Compliance, Cybersecurity, Privacy & AI Safety: A Comprehensive Survey, Roadmap & Implementation Blueprint

research note

Risks & Benefits of LLMs & GenAI for Platform Integrity, Healthcare Diagnostics, Financial Trust and Compliance, Cybersecurity, Privacy & AI Safety: A Comprehensive Survey, Roadmap & Implementation Blueprint

·11 min read·Kiarash Ahi

This paper is a broad survey and implementation blueprint about the dual-use impact of LLMs and GenAI on platform integrity, cybersecurity, privacy, financial compliance, and healthcare diagnostics

researchsurveyplatform-integrityllm-defensetrust-and-safety

Read note → Source paper ↗

An open-source Modular Online Psychophysics Platform (MOPP)

research note

An open-source Modular Online Psychophysics Platform (MOPP)

·11 min read·Yuval Samoilov-Kats, Matan Noach, Noam Beer et al.

MOPP is presented as an open-source, modular web platform for running online psychophysics experiments without requiring researchers to stitch together separate tools for task creation, hosting, au…

researchonline-experimentspsychophysicsresearch-platformbot-detection

Read note → Source paper ↗

EarthOL: A Proof-of-Human-Contribution Consensus Protocol -- Addressing Fundamental Challenges in Decentralized Value Assessment with Enhanced Verification and Security Mechanisms

research note

EarthOL: A Proof-of-Human-Contribution Consensus Protocol -- Addressing Fundamental Challenges in Decentralized Value Assessment with Enhanced Verification and Security Mechanisms

·8 min read·Jiaxiong He

EarthOL proposes a domain-restricted alternative to proof-of-work: instead of burning energy on arbitrary computation, the protocol tries to reward verifiable human contributions in bounded domains…

researchconsensus-protocolproof-of-humanfraud-detectionsybil-resistance

Read note → Source paper ↗

Articles are CC BY 4.0 — feel free to quote with attribution