DEVLIN J , CHANG M W , LEE K , et al . BERT: pre-training of deep bidirectional transformers for language understanding [J ] . arXiv preprint , arXiv: 1810.04805 , 2018 .
HARRER S . Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine [J ] . eBioMedicine , 2023 , 90 : 104512 .
PINTO G , CARDOSO-PEREIRA I , MONTEIRO D , et al . Large language models for education: grading open-ended questions using ChatGPT [J ] . arXiv preprint , arXiv: 2307.16696 , 2023 .