A framework for longitudinal health AI agents

  • Nedos, I. et al. Is artificial intelligence ready for emergency department triage? a retrospective evaluation of multiple large language models in 39,375 patients at a university emergency department. J. Clin. Med. 15, 1512 (2026).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Gaber, F. et al. Evaluating large language model workflows in clinical decision support for triage and referral and diagnosis. NPJ Digit. Med. 8, 263 (2025).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Xie, Q. et al. Medical foundation large language models for comprehensive text analysis and beyond. NPJ Digit. Med. 8, 141 (2025).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Biswas, A. & Talukdar, W. Intelligent clinical documentation: harnessing generative AI for patient-centric clinical note generation. Int. J. Innov. Sci. Res. Technol. https://doi.org/10.38124/ijisrt/IJISRT24MAY1483 (2024).

  • Maity, S. & Saikia, M. J. Large language models in healthcare and medical applications: a review. Bioengineering 12, 631 (2025).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Aydin, S., Karabacak, M., Vlachos, V. & Margetis, K. Large language models in patient education: a scoping review of applications in medicine. Front. Med. 11, 1477898 (2024).

    Article 

    Google Scholar
     

  • Lin, C. & Kuo, C. -F. Roles and potential of large language models in healthcare: a comprehensive review. Biomed. J. 48, 100868 (2025).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Jörke, M. et al. GPTCoach: towards LLM-based physical activity coaching. In Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems https://doi.org/10.1145/3706598.3713819 (2024).

  • Ong, Q. C. et al. Advancing health coaching: a comparative study of large language model and health coaches. Artif. Intell. Med. 157, 103004 (2024).

    Article 
    PubMed 

    Google Scholar
     

  • Schulman-Green, D. et al. Processes of self-management in chronic illness. J. Nurs. Scholarsh. 44, 136–144 (2012).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Peerbolte, T. F. et al. Conversational agents supporting self-management in people with a chronic disease: systematic review. J. Med. Int. Res. 27, e72309 (2025).


    Google Scholar
     

  • Serugunda, H. M. et al. Using large language models for chronic disease management tasks: scoping review. JMIR Med. Inform. 13, e66905 (2025).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Shayaninasab, M., Zahoor, M. & Yalçin, Ö. N. Enhancing patient intake process in mental health consultations using rag-driven chatbot. In 2024 12th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW), 256–264 https://doi.org/10.1109/ACIIW63320.2024.00053 (IEEE, 2024).

  • Ayers, J. W. et al. Comparing physician and artificial intelligence chatbot responses to patient questions posted to a public social media forum. JAMA Int. Med. 183, 589–596 (2023).

    Article 

    Google Scholar
     

  • Haag, D. et al. The last JITAI? exploring large language models for issuing just-in-time adaptive interventions: fostering physical activity in a prospective cardiac rehabilitation setting. In Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems, 1–18 https://doi.org/10.1145/3706598.3713307 (2024).

  • Artsi, Y. et al. Large language models in real-world clinical workflows: a systematic review of applications and implementation. Front. Digit. Health 7, 1659134 (2025).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Farzan, M., Ebrahimi, H., Pourali, M. & Sabeti, F. Artificial intelligence-powered cognitive behavioural therapy chatbots, a systematic review. Iran. J. Psychiatry 20, 102–110 (2025).

    PubMed 
    PubMed Central 

    Google Scholar
     

  • Wang, J. et al. Psychological counseling cannot be achieved overnight: automated psychological counseling through multi-session conversations. Preprint at https://arxiv.org/abs/2506.06626 (2025).

  • McFadyen, J. et al. Increasing engagement with cognitive-behavioral therapy (CBT) using generative AI: a randomized controlled trial (RCT). Commun. Med. 6, 129 (2026).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Sinha, C., Thakkar, R., Meheli, S. & Dinesh, D. Exploring the role of app features in providing continuity of care to users on a digital mental health platform (Wysa): Retrospective mixed methods observational study. JMIR Form. Res. 10, e73033 (2026).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Zhang, C. et al. A survey on multi-turn interaction capabilities of large language models. Preprint at https://arxiv.org/abs/2501.09959 (2025).

  • Uijen, A. A., Schers, H. J., Schellevis, F. G. & van den Bosch, W. J. How unique is continuity of care? a review of continuity and related concepts. Fam. Pract. 29, 264–271 (2012).

    Article 
    PubMed 

    Google Scholar
     

  • Saultz, J. W. & Lochner, J. Interpersonal continuity of care and care outcomes: a critical review. Ann. Fam. Med. 3, 159–166 (2005).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Gray, D. J. P., Sidaway-Lee, K., White, E., Thorne, A. & Evans, P. H. Continuity of care with doctors—a matter of life and death? A systematic review of continuity of care and mortality. BMJ Open 8, e021161 (2018).

    Article 

    Google Scholar
     

  • Van Walraven, C., Oake, N., Jennings, A. & Forster, A. J. The association between continuity of care and outcomes: a systematic and critical review. J. Eval. Clin. Practice 16, 947–956 (2010).

    Article 

    Google Scholar
     

  • Zhang, T. et al. History-aware hierarchical transformer for multi-session open-domain dialogue system. In Findings of the Association for Computational Linguistics: EMNLP 2022, 3395–3407 https://doi.org/10.18653/v1/2022.findings-emnlp.247 (2022).

  • Maharana, A. et al. Evaluating very long-term conversational memory of LLM agents. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 13851–13870 https://doi.org/10.18653/v1/2024.acl-long.747 (2024).

  • Ge, Y. et al. TReMu: towards neuro-symbolic temporal reasoning for LLM-agents with memory in multi-session dialogues. In Findings of the Association for Computational Linguistics: ACL 2025, 18974–18988 https://doi.org/10.18653/v1/2025.findings-acl.972 (2025).

  • Reynolds, R. et al. A systematic review of chronic disease management interventions in primary care. BMC Fam. Pract. 19, 11 (2018).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Jones, D., Dunn, L., Watt, I. & Macleod, U. Safety netting for primary care: evidence from a literature review. Br. J. Gen. Pract. 69, e70–e79 (2019).

    Article 
    PubMed 

    Google Scholar
     

  • Callen, J. L., Westbrook, J. I., Georgiou, A. & Li, J. Failure to follow-up test results for ambulatory patients: a systematic review. J. Gen. Intern. Med. 27, 1334–1348 (2011).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Rothman, A. A. & Wagner, E. H. Chronic illness management: what is the role of primary care?. Ann. Intern. Med. 138, 256–261 (2003).

    Article 
    PubMed 

    Google Scholar
     

  • Almond, S., Mant, D. & Thompson, M. Diagnostic safety-netting. Br. J. Gen. Pract. 59, 872–874 (2009).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Li, I., Dey, A. & Forlizzi, J. A stage-based model of personal informatics systems. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI ’10), 557–566 https://doi.org/10.1145/1753326.1753409 (2010).

  • Nahum-Shani, I., Hekler, E. B. & Spruijt-Metz, D. Building health behavior models to guide the development of just-in-time adaptive interventions: a pragmatic framework. Health Psychol. 34, 1209–1219 (2015).

    Article 

    Google Scholar
     

  • Hsu, T. -C. C. et al. Personalized interventions for behaviour change: a scoping review of just-in-time adaptive interventions. Br. J. Health Psychol. 30, e12766 (2024).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Bosschaerts, K. et al. Designing a just-in-time adaptive intervention with trigger detection and a generative chatbot: smoking cessation use case. Digit. Health https://doi.org/10.1177/20552076251381747 (2025).

  • Lu, T., Lin, Q., Yu, B. & Hu, J. A systematic review of strategies in digital technologies for motivating adherence to chronic illness self-care. NPJ Health Syst. 2, 13 (2025).

    Article 

    Google Scholar
     

  • Chen, C. et al. Followupbot: an LLM-based conversational robot for automatic postoperative follow-up. In International Conference on Behavioural and Social Computing 252–260 (Springer Nature Singapore, 2025).

  • Mamykina, L., Smaldone, A. M. & Bakken, S. R. Adopting the sensemaking perspective for chronic disease self-management. J. Biomed. Inform. 56, 406–417 (2015).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Lin, G., Le, M. N., Truong, K. N. & Mariakakis, A. The cognitive strategies behind multimodal health sensemaking: a menstrual health tracking case study. in Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies https://doi.org/10.1145/3749482 (2025).

  • Mulani, J. et al. Deep reinforcement learning based personalized health recommendations. In Deep Learning Techniques for Biomedical and Health Informatics, 231–255 (Springer, 2019).

  • Abbasian, M., Azimi, I., Rahmani, A. M. & Jain, R. Conversational health agents: a personalized large language model-powered agent framework. JAMIA Open 8, ooaf067 (2025).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Su, J. et al. Investigating the factors influencing users’ adoption of artificial intelligence health assistants based on an extended UTAUT model. Sci. Rep. 15, 18215 (2025).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Afroogh, S., Akbari, A., Malone, E., Kargar, M. & Alambeigi, H. Trust in AI: progress, challenges, and future directions. Humanit. Soc. Sci. Commun. 11, 1568 (2024).

    Article 

    Google Scholar
     

  • Sivaraman, V., Bukowski, L. A., Levin, J., Kahn, J. M. & Perer, A. Ignore, trust, or negotiate: understanding clinician acceptance of AI-based treatment recommendations in health care. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 1–18 (2023).

  • Mick, I., Freger, S. M., van Keizerswaard, J., Gholiof, M. & Leonardi, M. Comprehensive endometriosis care: a modern multimodal approach for the treatment of pelvic pain and endometriosis. Ther. Adv. Reprod. Health 18, 26334941241277759 (2024).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Becker, C. M., Gattrell, W. T., Gude, K. & Singh, S. S. Reevaluating response and failure of medical treatment of endometriosis: a systematic review. Fertil. Steril. 108, 125–136 (2017).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Devan, H., Hale, L., Hempel, D., Saipe, B. & Perry, M. A. What works and does not work in a self-management intervention for people with chronic pain? Qualitative systematic review and meta-synthesis. Phys. Ther. 98, 381–397 (2018).

    Article 
    PubMed 

    Google Scholar
     

  • Edgley, K., Horne, A. W., Saunders, P. T. K. & Tsanas, A. Symptom tracking in endometriosis using digital technologies: knowns, unknowns, and future prospects. Cell Rep. Med. 4, 101192 (2023).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Trepanier, L. C. M. et al. Smartphone apps for menstrual pain and symptom management: a scoping review. Internet Interv. 31, 100605 (2023).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Requadt, E., Nahlik, A. J., Jacobsen, A. & Ross, W. T. Patient experiences of endometriosis diagnosis: a mixed methods approach. BJOG 131, 941–951 (2024).

    Article 
    PubMed 

    Google Scholar
     

  • Gracia, E. et al. The vulnerable phase of heart failure. Am. J. Ther. 25, e456–e464 (2018).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Greene, S. J. et al. The vulnerable phase after hospitalization for heart failure. Nat. Rev. Cardiol. 12, 220–229 (2015).

    Article 
    PubMed 

    Google Scholar
     

  • Regalbuto, R., Maurer, M. S., Chapel, D., Mendez, J. & Shaffer, J. A. Joint commission requirements for discharge instructions in patients with heart failure: is understanding important for preventing readmissions?. J. Card. Fail. 20, 641–649 (2014).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Heidenreich, P. A. et al. 2022 AHA/ACC/HFSA Guideline for the Management of Heart Failure: a report of the American College Of Cardiology/American Heart Association Joint Committee on clinical practice guidelines. Circulation 145, e895–e1032 (2022).

    PubMed 

    Google Scholar
     

  • Weiss, A. J. & Jiang, H. J. Overview of clinical conditions with frequent and costly hospital readmissions by payer, 2018. in Healthcare Cost and Utilization Project (HCUP) Statistical Brief #278 (Agency for Healthcare Research and Quality, 2021).

  • Lee, K. K., Yang, J., Hernandez, A. F., Steimle, A. E. & Go, A. S. Post-discharge follow-up characteristics associated with 30-day readmission after heart failure hospitalization. Med. Care 54, 365–372 (2016).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Tung, Y. -C., Chang, G. -M., Chang, H. -Y. & Yu, T. -H. Relationship between early physician follow-up and 30-day readmission after acute myocardial infarction and heart failure. PLoS ONE 12, e0170061 (2017).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Lainscak, M. et al. Self-care management of heart failure: practical recommendations from the Patient Care Committee of the Heart Failure Association of the European Society of Cardiology. Eur. J. Heart Fail. 13, 115–126 (2011).

    Article 
    PubMed 

    Google Scholar
     

  • Balaskas, A., Schueller, S. M., Cox, A. L. & Doherty, G. Ecological momentary interventions for mental health: a scoping review. PLoS ONE 16, e0248152 (2021).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Torous, J. et al. The growing field of digital psychiatry: current evidence and the future of apps, social media, chatbots, and virtual reality. World Psychiatry 20, 318–335 (2021).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Haaker, J. et al. Deficient inhibitory processing in trait anxiety: Evidence from context-dependent fear learning, extinction recall and renewal. Biol. Psychol. 111, 65–72 (2015).

    Article 
    CAS 
    PubMed 

    Google Scholar
     

  • Hindmarch, T., Hotopf, M. & Owen, G. S. Depression and decision-making capacity for treatment or research: a systematic review. BMC Med. Ethics 14, 54 (2013).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Si, Y. et al. Quality, safety and disparity of an AI chatbot in managing chronic diseases: simulated patient experiments. NPJ Digit. Med. 8, 574 (2025).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Yu, C. et al. From passive to proactive: a multi-agent system with dynamic task orchestration for intelligent medical pre-consultation. Preprint at https://arxiv.org/abs/2511.01445 (2025).

  • Wu, D. et al. LongMemEval: benchmarking chat assistants on long-term interactive memory. In Proceedings of the International Conference on Learning Representations (2025).

  • Noah, B. et al. Impact of remote patient monitoring on clinical outcomes: an updated meta-analysis of randomized controlled trials. NPJ Digit. Med. 1, 20172 (2018).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Hamine, S., Gerth-Guyette, E., Faulx, D., Green, B. B. & Ginsburg, A. S. Impact of mHealth chronic disease management on treatment adherence and patient outcomes: a systematic review. J. Med. Internet Res. 17, e52 (2015).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Vegesna, A., Tran, M., Angelaccio, M. & Arcona, S. Remote patient monitoring via non-invasive digital technologies: a systematic review. Telemed. J. E Health 23, 3–17 (2017).

    Article 
    PubMed 

    Google Scholar
     

  • Smedslund, G., Osteras, N. & Hestevik, C. H. Effects of remote patient monitoring on health care utilization in patients with noncommunicable diseases: systematic review and meta-analysis. JMIR Mhealth Uhealth 13, e68464 (2025).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Merrill, M. A. et al. Transforming wearable data into personal health insights using large language model agents. Nat. Commun. 17, 1143 (2026).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Mamykina, L. et al. Personal discovery in diabetes self-management: discovering cause and effect using self-monitoring data. J. Biomed. Inform. 76, 1–8 (2017).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Source link

    Leave a Reply

    Your email address will not be published. Required fields are marked *