Machine Learning and Teaching

I just responded to an unsolicited email from a consultant working for Pearson publishing – perhaps you received one too. The sender was requesting my participation in the following scheme:

They provide five essay questions that I can assign to my students. My students enter the essays through an online portal. The essays will then be graded by “subject area experts” and the grades and comments will be returned to me – I am free to pass these on to students or use as I like. For my trouble: “you would have a couple of essays graded for you. Also, Pearson will pay you $100.” 

They will use the students’ work to “build the bank of student essays needed to develop the product.” The product is a “computer-assisted grading program that will support you and your students when assigning short writing assignments.”

What they are up to, one suspects, is developing a training corpus for machine learning algorithms.  It’s a relatively straight-forward classification problem.  They don’t need to figure out what makes a good answer to a given essay question – if they have enough human evaluated examples, they can train the machine to do just as well as the humans.  Just as well, that is, as the “subject area experts” they hire.

In my email response to the consultant I raised a different question: how much are they planning to compensate the students whose copyrighted intellectual property they are asking me to facilitate them obtaining for the development of a commercial product. I asked what advice their lawyers had given them regarding the commercial use of material that students are compelled to produce and submit as a requirement of a class.

Would you require your students to send their work to Pearson?  Would you accept payment for doing so?   Even if this is considered fair use under copyright law*, should institutions and instructors be in the business of building up Pearson’s content for a product that Pearson will then turn around and sell back to us?

Personally, I say no thanks. Seems to me just one more step toward making colleges mere franchises and store fronts for educational publishers. It’s too bad we are not collectively producing tools like this for the public benefit rather than being coopted into contributing to the progressive privatization of pedagogy.

And I think I’ll start recommending that my students consider appending a CC BY-NC 4.0 license to work they are willing to share.

*A similar question has arisen in connection with Turn-It-In a service that checks for plagiarism. That company has prevailed so far in lawsuits that claim it makes illegal use of copyrighted student material.

See Also