Search Articles

View query in Help articles search

Search Results (1 to 10 of 2583 Results)

Download search results: CSV END BibTex RIS


Benchmarking the Confidence of Large Language Models in Answering Clinical Questions: Cross-Sectional Evaluation Study

Benchmarking the Confidence of Large Language Models in Answering Clinical Questions: Cross-Sectional Evaluation Study

Our study corroborates GPT-4’s strong performance, particularly in psychiatry, where GPT-4o achieved 84.4% accuracy. However, our findings suggest that more cautious interpretation is needed, given the high confidence levels observed for incorrect answers. Xiong et al’s [17] work on LLM confidence elicitation aligns with our observations of overconfidence.

Mahmud Omar, Reem Agbareia, Benjamin S Glicksberg, Girish N Nadkarni, Eyal Klang

JMIR Med Inform 2025;13:e66917

Investigating Social Network Peer Effects on HIV Care Engagement Using a Fuzzy-Like Matching Approach: Cross-Sectional Secondary Analysis of the N2 Cohort Study

Investigating Social Network Peer Effects on HIV Care Engagement Using a Fuzzy-Like Matching Approach: Cross-Sectional Secondary Analysis of the N2 Cohort Study

Within the network autocorrelation model [39,44-46], we specified the network effects model, that is defined in the following formula: in which y is a vector for values of our outcome variable (ie, engagement in status-neutral HIV-related care), and X is a matrix of values for the N actors.

Cho-Hee Shrader, Dustin T Duncan, Redd Driver, Juan G Arroyo-Flores, Makella S Coudray, Raymond Moody, Yen-Tyng Chen, Britt Skaathun, Lindsay Young, Natascha del Vecchio, Kayo Fujimoto, Justin R Knox, Mariano Kanamori, John A Schneider

JMIR Public Health Surveill 2025;11:e64497

The Prevalence and Incidence of Suicidal Thoughts and Behavior in a Smartphone-Delivered Treatment Trial for Body Dysmorphic Disorder: Cohort Study

The Prevalence and Incidence of Suicidal Thoughts and Behavior in a Smartphone-Delivered Treatment Trial for Body Dysmorphic Disorder: Cohort Study

Multivariate models included the following baseline variables: birth sex (female: n=67, 83.8%), age (mean 27, SD 9.64; C-SSRS lifetime suicidal ideation severity (mean 1.83, SD 1.86, median 1, IQR 3.25), lifetime suicide attempt (n=8, 10%), QIDS-SR depression severity total score, excluding the death or suicide related thoughts item #12 (mean 10.94, SD 3.90), BDD symptom severity (BDD Y-BOCS, mean 30.35, SD 4.4), and sexual orientation.

Adam C Jaroszewski, Natasha Bailen, Simay I Ipek, Jennifer L Greenberg, Susanne S Hoeppner, Hilary Weingarden, Ivar Snorrason, Sabine Wilhelm

JMIR Ment Health 2025;12:e63605