You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
UnicodeEncodeError: 'charmap' codec can't encode character '\u3001' in position 32198: character maps to <undefined>
The root cause is that Windows is trying to print text with characters (like \u3001) that don’t exist in the default code page cp1252. In other words, Markitdown is generating Unicode output, but printing it directly to the console fails on Windows when the console doesn’t support that character.
My system:
Windows 11
Python 3.13
I attempted to use
markitdown
like this at first:But that leads to a Unicode error:
The root cause is that Windows is trying to print text with characters (like
\u3001
) that don’t exist in the default code pagecp1252
. In other words, Markitdown is generating Unicode output, but printing it directly to the console fails on Windows when the console doesn’t support that character.To mitigate this problem, I now try this:
>markitdown user_manual.pdf -o user_manual.md
Which results in:
How can it not recognize the
-o
flag? The homepage ofmarkitdown
shows the-o
flag as a valid input flag for the tool.The text was updated successfully, but these errors were encountered: