Skip to content

Unsupported DOC file content type (application/msword) #1220

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
amitp-gc opened this issue Apr 27, 2025 · 3 comments
Open

Unsupported DOC file content type (application/msword) #1220

amitp-gc opened this issue Apr 27, 2025 · 3 comments

Comments

@amitp-gc
Copy link

amitp-gc commented Apr 27, 2025

issue description

What happened?
When attempting to convert a Microsoft Word (.doc) file to Markdown, markitdown fails with an UnsupportedFormatException indicating that no converter recognized the stream’s format:

`INFO - File download completed in 0.85s. Content-Type: application/msword, Content-Length: 102912

markitdown._exceptions.UnsupportedFormatException: Could not convert stream to Markdown. No converter attempted a conversion, suggesting that the filetype is simply not supported.

ValueError: Failed to convert file: Could not convert stream to Markdown. No converter attempted a conversion, suggesting that the filetype is simply not supported.`

@gabrielrdrguez
Copy link

I think it's just not supported, but it would be nice if it were

@orasik
Copy link

orasik commented May 9, 2025

Had the same issue, file type application/msword

@hu-qi
Copy link

hu-qi commented May 12, 2025

the same.

python 3.11.11
mac os 15.4.1 (apple m4)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants