[prototype] object det replacement / init contrib modules #1534

felixdittrich92 · 2024-03-28T11:13:51Z

NOTE: Only a dirty first draft for discussions :)

The idea behind a contrib module:

easy to extend with further functionalites like: doctr++ | UVDoc / table Transformer / super resolution / etc. for example
3 main requirements:
-- modules needs to work with the DocumentBuilder output structure (List[np.ndarray])
-- framework independent (no torch or tf allowed)
-- if model loading is required --> onnxruntime

Further advantages on our side:

Less code to maintain
Full focus on OCR quality
Maybe also a cool new place for contributors without the need to step deep into doctr's core

A first service for object detection which replaces the current (not really maintained and robust) one 😅

trained with yolov8 (artefact detection dataset) --> training outsourced to ultralytics -> easy to replace with any yolov8 (no OBB atm) trained model (exported to onnx)

example:

import os

from doctr.contrib import ArtefactDetector
from doctr.io import DocumentFile

root = "/home/felix/Desktop/doctr_test_data"

doc = DocumentFile.from_images([os.path.join(root, "6.jpg"), os.path.join(root, "7.jpg"), "/home/felix/Desktop/5a89cd6d989803e2.jpg"])

detector = ArtefactDetector(batch_size=2, conf_threshold=0.9, iou_threshold=0.5)

res = detector(doc)
detector.show()
print(res)

out:

[[], [], [{'label': 'photo', 'confidence': 0.9760028, 'box': [665, 194, 767, 319]}, {'label': 'logo', 'confidence': 0.97400737, 'box': [158, 745, 275, 862]}, {'label': 'photo', 'confidence': 0.9690048, 'box': [13, 790, 131, 933]}, {'label': 'bar_code', 'confidence': 0.94711626, 'box': [647, 425, 795, 469]}, {'label': 'logo', 'confidence': 0.94670904, 'box': [314, 9, 372, 67]}, {'label': 'qr_code', 'confidence': 0.94380593, 'box': [368, 691, 417, 740]}, {'label': 'bar_code', 'confidence': 0.9333756, 'box': [291, 813, 339, 850]}, {'label': 'bar_code', 'confidence': 0.93306136, 'box': [391, 108, 683, 151]}]]

visualized:

TODOS would be:

refactor code (each contrib module should inerhit from base.py - BasePredictor (call, (_process _preprocess needs every contrib module to implement))
docs
tests

felixdittrich92 · 2024-03-28T11:14:10Z

@odulcy-mindee wdyt about the idea ?

doctr/contrib/artefacts.py

codecov · 2024-04-02T14:54:22Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 95.84%. Comparing base (f21ac32) to head (d746593).
Report is 2 commits behind head on main.

❗ Current head d746593 differs from pull request most recent head 4800e59. Consider uploading reports for the commit 4800e59 to get more accurate results

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #1534       +/-   ##
===========================================
+ Coverage   57.70%   95.84%   +38.14%     
===========================================
  Files         167      163        -4     
  Lines        7696     7705        +9     
===========================================
+ Hits         4441     7385     +2944     
+ Misses       3255      320     -2935

Flag	Coverage Δ
unittests	`95.84% <100.00%> (+38.14%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

felixdittrich92 · 2024-04-03T06:06:57Z

About the Artefact-DocumentBuilder element i'm not sure should we keep it for maybe later usage ? (As a kind of "add to the OCR results") Or should we remove it (currently always empty and unused) ?
In general it makes more sense to encapsulate the object detection (and other possible pre-proc modules as mentioned) from the OCR pipeline to interact directly with the DocumentFile output.
As an example: Now the object detection pre step could be used for example to mask some detected parts of the images before it's passed to the OCR pipeline to ignore irrelevant fields like images which contains text.

.github/workflows/references.yml

docs/source/using_doctr/using_contrib_modules.rst

doctr/contrib/__init__.py

doctr/contrib/base.py

tests/common/test_contrib.py

doctr/contrib/base.py

odulcy-mindee · 2024-04-18T13:44:09Z

In general it makes more sense to encapsulate the object detection (and other possible pre-proc modules as mentioned) from the OCR pipeline to interact directly with the DocumentFile output.

So, purpose of contrib is to always wrap a model and doing some predictions ? Or can it be only some postprocessing function ?

felixdittrich92 · 2024-04-18T14:13:08Z

In general it makes more sense to encapsulate the object detection (and other possible pre-proc modules as mentioned) from the OCR pipeline to interact directly with the DocumentFile output.

So, purpose of contrib is to always wrap a model and doing some predictions ? Or can it be only some postprocessing function ?

The idea was to make it compatible with the DocumentFile output so in this case it's only preprocessing.
And yes with the actual state each contrib module would need a onnx model do you have something in mind where it would make sense to have something without a model ? (It's a prototype so open for every idea :D)

odulcy-mindee · 2024-04-23T14:21:24Z

The idea was to make it compatible with the DocumentFile output so in this case it's only preprocessing.
And yes with the actual state each contrib module would need a onnx model do you have something in mind where it would make sense to have something without a model ? (It's a prototype so open for every idea :D)

Ok, for a first version, model is required. Code can be improved later to set onxxmodel as optional.

odulcy-mindee · 2024-04-23T14:24:19Z

pyproject.toml

@@ -155,6 +158,7 @@ module = [
 	"anyascii.*",
 	"tensorflow.*",
 	"torchvision.*",
+    "onnxruntime.*",


not aligned

that's a github view bug see:

felixdittrich92 · 2024-04-23T14:46:01Z

The idea was to make it compatible with the DocumentFile output so in this case it's only preprocessing.
And yes with the actual state each contrib module would need a onnx model do you have something in mind where it would make sense to have something without a model ? (It's a prototype so open for every idea :D)

Ok, for a first version, model is required. Code can be improved later to set onxxmodel as optional.

Correct :)

odulcy-mindee

model uploaded

doctr/contrib/artefacts.py

Co-authored-by: Olivier Dulcy <[email protected]>

felixdittrich92 requested a review from odulcy-mindee March 28, 2024 11:13

felixdittrich92 changed the title ~~[prototype] object det replacement / init contrib modules~~ [DRAFT][prototype] object det replacement / init contrib modules Mar 28, 2024

felixdittrich92 commented Apr 2, 2024

View reviewed changes

doctr/contrib/artefacts.py Show resolved Hide resolved

felixdittrich92 force-pushed the obj-prototype branch from 42ad57a to 2a55e41 Compare April 2, 2024 13:28

felixdittrich92 requested a review from frgfm April 13, 2024 14:57

felixdittrich92 marked this pull request as ready for review April 15, 2024 17:53

felixdittrich92 changed the title ~~[DRAFT][prototype] object det replacement / init contrib modules~~ [prototype] object det replacement / init contrib modules Apr 15, 2024

felixdittrich92 marked this pull request as draft April 15, 2024 18:15

odulcy-mindee reviewed Apr 18, 2024

View reviewed changes

felixdittrich92 force-pushed the obj-prototype branch from 5c03308 to 240bab2 Compare April 21, 2024 13:17

felixdittrich92 added 8 commits April 23, 2024 06:33

up

01d47ab

first draft - non working

3f7242a

up

12c021c

up

20de358

update

aee6b2a

up

0016a7a

update and add doc

023ee5e

update doc

284439d

felixdittrich92 added 11 commits April 23, 2024 06:33

mypy

16319e3

mypy + doc

eae4d7d

revert ci change

255e323

revert ci change

ea6e2e4

remove obj det from hub tests

e767cd4

update

e2ecd8f

init cover

1634d00

update

a50c1e2

apply suggestions

9872e15

small update

7258481

rebase and add requires

cd03646

felixdittrich92 force-pushed the obj-prototype branch from 483ff32 to cd03646 Compare April 23, 2024 04:43

felixdittrich92 added 2 commits April 23, 2024 07:18

update

6340a40

update

d746593

felixdittrich92 marked this pull request as ready for review April 23, 2024 05:59

odulcy-mindee reviewed Apr 23, 2024

View reviewed changes

odulcy-mindee reviewed Apr 25, 2024

View reviewed changes

doctr/contrib/artefacts.py Outdated Show resolved Hide resolved

Update doctr/contrib/artefacts.py

4800e59

Co-authored-by: Olivier Dulcy <[email protected]>

odulcy-mindee merged commit 630d925 into mindee:main Apr 25, 2024
70 of 78 checks passed

felixdittrich92 mentioned this pull request Apr 26, 2024

[references] remove missed parts of old obj det #1568

Merged

felixdittrich92 deleted the obj-prototype branch August 30, 2024 11:31

[prototype] object det replacement / init contrib modules #1534

[prototype] object det replacement / init contrib modules #1534

Uh oh!

Conversation

felixdittrich92 commented Mar 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

felixdittrich92 commented Mar 28, 2024

Uh oh!

Uh oh!

codecov bot commented Apr 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

felixdittrich92 commented Apr 3, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

odulcy-mindee commented Apr 18, 2024

Uh oh!

felixdittrich92 commented Apr 18, 2024

Uh oh!

odulcy-mindee commented Apr 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

odulcy-mindee Apr 23, 2024

Choose a reason for hiding this comment

Uh oh!

felixdittrich92 Apr 24, 2024

Choose a reason for hiding this comment

Uh oh!

felixdittrich92 commented Apr 23, 2024

Uh oh!

odulcy-mindee left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

felixdittrich92 commented Mar 28, 2024 •

edited

Loading

codecov bot commented Apr 2, 2024 •

edited

Loading

odulcy-mindee commented Apr 23, 2024 •

edited

Loading