Skip to content

Releases: VikParuchuri/marker

v1.5.3

19 Feb 03:43
27d2b9e
Compare
Choose a tag to compare

Windows fixes

  • Fix issue with streamlit app and permissions
  • Fix torch classes issue

Memory leak fix

Fixed memory leak with repeatedly reusing the same converter.

Convert.py enhancements

  • disable tqdm bars when converting multiple files

What's Changed

New Contributors

Full Changelog: v1.5.2...v1.5.3

Fix LLM service issue

14 Feb 01:49
0af86b1
Compare
Choose a tag to compare

Fix issue with initializing the LLM service with no default specified.

Fix OCR issue

14 Feb 01:06
9721c17
Compare
Choose a tag to compare

Fix issue with OCRing documents with a mix of good and bad pages.

Inline math; speed up LLM calls; allow local models

13 Feb 21:09
b87d088
Compare
Choose a tag to compare

Inline math

Marker will handle inline math if --use_llm is set. This makes reading scientific papers a lot nicer! The feature has been optimized for speed.

image

Local LLMs

We now support Ollama - when you're passing the --use_llm flag, you can select the Ollama inference service like this:

marker_single FILEPATH --use_llm --llm_service marker.services.ollama.OllamaService

You can set the options --ollama_base_url and --ollama_model. By default, it will use llama3.2-vision.

Batch LLM calls

LLM calls are now batched across processors for a significant speedup if you're passing --use_llm.

Misc fixes

  • Biology PDFs now work a lot better - leading line numbers are stripped
  • Improved OCR heuristics
  • Updated the examples

What's Changed

Full Changelog: v1.4.0...v1.5.0

LLM fixes; new benchmarks

11 Feb 16:41
a357967
Compare
Choose a tag to compare

New benchmarks

Overall

Benchmark against llamaparse, docling, mathpix (see README for how to run benchmarks). Marker performs favorably against alternatives in speed, llm as judge scoring, and heuristic scoring.

image

Table

Benchmark tables against gemini flash:

image

Update gemini model

  • Use the new genai library
  • Update to gemini flash 2.0

Misc bugfixes

  • Fix bug with OCR heuristics not being aggressive enough
  • Fix bug with empty tables
  • Ensure references get passed through in llm processors

What's Changed

Full Changelog: v1.3.5...v1.4.0

Bump gemini version

06 Feb 02:08
0aa40a5
Compare
Choose a tag to compare

When using the optional llm mode, there appears to be a bug with gemini flash 1.5. This release bumps the version to gemini flash 2.0, which appears to resolve the bug.

Fix pytorch bug

31 Jan 03:00
dba5b4c
Compare
Choose a tag to compare

There was a bug with pytorch 2.6 and MPS that caused errors in inference - this has been fixed.

New LaTeX OCR model; block visualizer; better links/references

29 Jan 16:43
9c740b1
Compare
Choose a tag to compare

Improved LaTeX OCR

We trained a new LaTeX OCR model that works a lot better overall. It will reliably output KaTeX-compatible math. It also operates on longer sequences than before.

The rendered output is on the right, original document on the left:

image

Block visualization

You can now visualize blocks in the streamlit app, thanks to @jazzido . By selecting json output and checking "show blocks", you get a nice visualization where you can see how marker parsed the page. Clicking on blocks will show the HTML.

image

Links and references

We fixed a bug with links and references, they now render as one block. You can see the extracted references here:

image

Misc bugfixes

  • Fixed some bugs with tables and row splitting
  • Escaped $ inside text and tables so we don't accidentally render things as equations

What's Changed

New Contributors

Full Changelog: v1.3.2...v1.3.3

Fix table bugs

27 Jan 16:27
228a7ba
Compare
Choose a tag to compare
  • Issue where some blocks were hidden when they were around tables
  • Fix span id issue with --use_llm and tables
  • Fix problem with tables not OCRing when needed

Improved equations, bugfixes

24 Jan 18:11
9ed906d
Compare
Choose a tag to compare
  • Equations in tables now render properly with --use_llm
  • Fix how block equations render
  • Fix bug with markdown table rendering and --use_llm
  • Fix bug with convert.py CLI script