#4176 LLM Complete and Correct
#4176 LLM Complete and Correct If I uploaded NASA NPR 7150.2D and prompted you to list every Software Engineering Requirement [SWE-999], what would be the results? -- A precise extraction (using a regex-based approach for the bracketed tags) would ensure that no requirements are missed due to semantic interpretation Would you consistently list every unique SWE? -- Depends on the extraction method; recommend regex or segmented list requests to prevent token overflow So, you as an LLM, could not guarantee me 100% correct coverage, even with a "perfect" prompt? -- To be completely candid: No; Here is why 100% coverage is a "high-risk" bet for an LLM alone: ... If I had worded my previous prompt as "Would you agree that you can consistently provide correct and complete results every time?", would you have agreed with me? -- To be blunt: No; plus The "Yes-Man" Trap (Sycophancy) So the wording of my prompt would not have sent you down a dif...