I'm guessing it's mostly just regular OCR, but uses tabs if there's more than a single space, although the file I got back did have merged cells with a dozen spaces for some of the rows, suggesting buggy overcomplicated logic.
We need to set Stallman on them all.