As a starting place, here are initial recommendations of questions to be asked when considering the use of an LLM as part of your scientific research workflow.
- When deploying an LLM in your research do you have systems in place for documenting its use?
- Will students or others also be using LLMs in their contributions, and have you discussed with them the norms/standards for documenting their use?
- Will only predefined prompts be used, or will impromptu prompts be added? How will those decisions be made and documented?
- How will data generated by LLMs be labeled, cataloged, stored, and managed?