Discussion about this post

User's avatar
Christa Albrecht-Crane's avatar

I've read this post with great interest. I am writing a paper right now about NotebookLM and its limitations. This work here is very insightful. Of course, ideally, you would find human raters who evaluate the output for gender bias and then compare their ratings to understand better what sort of bias is present in the training data. The more I learn about the built-in error rates of statistical language models AND bias in the training data, the more I realize that rigorous and trustworthy work cannot be outsourced to them. As you say in the last paragraph, we need to create habits of mind and work that are always suspicious of the hyperbolic claims made by "AI" marketing.

Expand full comment
1 more comment...

No posts