Research · Page 59 | CaptchaLa Blog

research note

Reinforcement Learning for Exponential Utility: Algorithms and Convergence in Discounted MDPs

May 8, 2026·10 min read·Gugan Thoppe, L. A. Prashanth, Ankur Naskar et al.

This paper addresses a foundational gap in risk-sensitive reinforcement learning: the absence of principled, model-free, value-based (Q-learning-style) algorithms for optimizing exponential utility…

researchrisk-sensitive-rlexponential-utilityq-learningstochastic-approximation

Read note → Source paper ↗

research note

SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation

May 8, 2026·14 min read·Tianfei Ren, Zhipeng Yan, Yiming Zhao et al.

SCOPE addresses a core failure mode in complex text-to-image generation that the authors term the 'Conceptual Rift': even when multi-step systems retrieve information, verify outputs, and attempt r…

researchtext-to-image-generationagentic-pipelinesstructured-verificationmulti-constraint-evaluation

Read note → Source paper ↗

research note

Securing Computer-Use Agents: A Unified Architecture-Lifecycle Framework for Deployment-Grounded Reliability

May 8, 2026·12 min read·Zejian Chen, Zhanyuan Liu, Chaozhuo Li et al.

This paper addresses a structural gap in the computer-use agent (CUA) literature: existing surveys organize the field by methods, platforms, benchmarks, or threat categories, but none provide a uni…

researchcomputer-use-agentsagent-securityruntime-oversightlifecycle-analysis

Read note → Source paper ↗

research note

Semi-supervised Method for Risk Prediction with Doubly Censored EHR Data

May 8, 2026·12 min read·Jie Zhou, Enhao Wang, Xuan Wang

This paper addresses a methodological gap in survival analysis for electronic health record (EHR) data: how to estimate risk effects (covariate coefficients in a semiparametric transformation model…

researchsemi-supervised-learningsurvival-analysiselectronic-health-recordsdouble-censoring

Read note → Source paper ↗

research note

TCMIIES: A Browser-Based LLM-Powered Intelligent Information Extraction System for Academic Literature

May 8, 2026·13 min read·Hanqing Zhao

TCMIIES (Traditional Chinese Medicine Information Intelligent Extraction System) addresses the practical barrier between LLM-powered information extraction and domain researchers who lack programmi…

researchinformation-extractionllm-promptingbrowser-based-nlpschema-guided-prompting

Read note → Source paper ↗

research note

Towards Highly-Constrained Human Motion Generation with Retrieval-Guided Diffusion Noise Optimization

May 8, 2026·13 min read·Hanchao Liu, Fang-Lue Zhang, Shining Zhang et al.

This paper addresses a fundamental failure mode of existing training-free diffusion noise optimization (DNO) methods for human motion generation: they break down when constraints become highly chal…

researchhuman-motion-generationdiffusion-noise-optimizationretrieval-augmented-generationtraining-free-control

Read note → Source paper ↗

research note

VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection

May 8, 2026·14 min read·James Petullo, Sonny George, Dylan Cashman et al.

This paper addresses a real cost problem with 'think twice' inference-time scaling: methods like Confidence-Informed Self-Consistency (CISC) improve accuracy over vanilla Self-Consistency by having…

researchinference-time-scalingself-consistencyllm-efficiencyembedding-clustering

Read note → Source paper ↗

research note

WebTrap: Stealthy Mid-Task Hijacking of Browser Agents During Navigation

May 8, 2026·8 min read·Zhichao Liu, Wenbo Pan, Haining Yu et al.

This paper addresses security vulnerabilities in browser agents tasked with performing long-horizon navigation and interaction workflows

researchprompt-injectionbrowser-agentlong-horizon-navigationsecurity-vulnerability

Read note → Source paper ↗

research note

When the Ruler is Broken: Parsing-Induced Suppression in LLM-Based Security Log Evaluation

May 8, 2026·7 min read·Chaitanya Vilas Garware, Sharif Noor Zisad

This paper identifies and quantifies a critical evaluation failure mode in large language model (LLM)-based Security Operations Center (SOC) log classification systems, termed parsing-induced suppr…

researchsoc-llm-evaluationparsing-induced-suppressionsecurity-log-classificationlora-fine-tuning

Read note → Source paper ↗

research note

Zero-Shot Imagined Speech Decoding via Imagined-to-Listened MEG Mapping

May 8, 2026·13 min read·Maryam Maghsoudi, Shihab Shamma

This paper tackles one of the hardest problems in non-invasive brain-computer interfaces: decoding imagined speech from MEG without requiring labeled imagined-speech training data

researchbrain-computer-interfacemeg-decodingimagined-speechcontrastive-learning

Read note → Source paper ↗

research note

A statistical look on kinematic planes of satellite galaxies II: The physics behind their early formation in TNG50 MW/M31-like galaxies

May 7, 2026·12 min read·Matías Gámez-Marín, Rosa Domínguez-Tenreiro, Isabel Santos-Santos et al.

This paper (Paper VI in a series) investigates why kinematically persistent planes (KPPs) of satellite galaxies form around Milky Way / M31-like hosts in the TNG50 cosmological simulation

researchcosmological-simulationsatellite-galaxieslarge-scale-structuren-body-simulation

Read note → Source paper ↗

research note

Absolute continuity of generalized Wasserstein barycenters of finitely many measures

May 7, 2026·11 min read·Jianyu Ma

This paper addresses the absolute continuity of generalized Wasserstein barycenters on complete Riemannian manifolds when the transport cost takes the form c(x,y) = h(d_g(x,y)) for a strictly conve…

researchoptimal-transportwasserstein-barycenterriemannian-geometrymeasure-theory

Read note → Source paper ↗