or
or
By clicking below, you agree to our terms of service.
New to HackMD? Sign up
Syntax | Example | Reference | |
---|---|---|---|
# Header | Header | 基本排版 | |
- Unordered List |
|
||
1. Ordered List |
|
||
- [ ] Todo List |
|
||
> Blockquote | Blockquote |
||
**Bold font** | Bold font | ||
*Italics font* | Italics font | ||
~~Strikethrough~~ | |||
19^th^ | 19th | ||
H~2~O | H2O | ||
++Inserted text++ | Inserted text | ||
==Marked text== | Marked text | ||
[link text](https:// "title") | Link | ||
 | Image | ||
`Code` | Code |
在筆記中貼入程式碼 | |
```javascript var i = 0; ``` |
|
||
:smile: | ![]() |
Emoji list | |
{%youtube youtube_id %} | Externals | ||
$L^aT_eX$ | LaTeX | ||
:::info This is a alert area. ::: |
This is a alert area. |
On a scale of 0-10, how likely is it that you would recommend HackMD to your friends, family or business associates?
Please give us some advice and help us improve HackMD.
Syncing
xxxxxxxxxx
This note moved to: https://pad.gwdg.de/75dyxG6gS-e0Q04_fpm-ng?view Please submit your suggestions there!
Open OCR-D-TechCall
General information
The open OCR-D-TechCall takes place every second Wednesday, 2 to 3 p.m. CET. Feel free to suggest important issues for discussion and vote for the issues which should be dealt with in the next call. Furthermore, each week one participant can talk about his (current) experiences and challenges with OCR-D. If you want to share your experiences in one of the next calls, just put your name on the list. On the Monday before a call, we will put the agenda together and send it to you.
Agenda for the next call
(numerical prefix represents subjective urgency. 1 = low, 2 = medium, 3 = high)
Agenda of the last call
.id
attributePostponed
Archive
Robert Sachunsky: Distributed Ground Truth transcription with Excel
Deprecating Python 3.6
Evaluation of cor-asv-ann postcorrection based on various OCR models and in-/out-domain data https://github.com/ASVLeipzig/cor-asv-ann
New ocrd_all release
ocrd-anybaseocr-block-segmentation
Voting on the defintion of CER :-) https://github.com/impactcentre/ocrevalUAtion/issues/21 https://github.com/roy-ht/editdistance/issues/28 https://github.com/roy-ht/editdistance/issues/38
https://github.com/kba/vdhd-2021-05-12 (btw also https://github.com/kba/vdhd-2021-05-05)
eynollah with OCR-D bindings released
Rewritten ocrd-anybaseocr-crop
https://github.com/kba/page-to-alto
<SP/>
and<HYP/>
https://github.com/kba/page-to-alto/issues/6https://github.com/kba/page-to-alto
eynollah with OCR-D bindings ready to test
Vahid Rezanezhad (@vahidrezanezhad) Eynollah - multi-model trainable segmentation and region classification
Kai Labusch (@labusch) neat - online NER entity annotation (and GT editing!) tool
Merlijn B. Wajer (MerlijnWajer) OCR @ Internet archive and script/lang detection
Dynamic model selection from metadata
Moving away from PIL? Where to?
Workflow Server Demo
Sunsetting Python 3.6
Mike Gerber: https://github.com/qurator-spk/ocrd-galley
ocrd-calamari-recognize
when venv is activeocrd process
Jochen Barth: https://github.com/jbarth-ubhd/blitzDrt
Uwe Hartwig: https://github.com/ulb-sachsen-anhalt/digital-derivans
Bernhard Liebl: https://github.com/poke1024/origami
Resource Management and ocrd_all
Performance
Automatically choosing models from language metadata
OCR alignment and diff view for browse-ocrd
Alternative PAGE-XML visualization CLI
Integrating origami into OCR-D?
End of support for Python 3.5 in OCR-D: March 1, 2021
New in 2.22.0
ocrd workspace rename-group
https://github.com/OCR-D/core/pull/655OcrdPage.get_AllAlternativeImages
https://github.com/OCR-D/core/pull/654Upcoming for 2.2x
Transcription with Excel
Resources (models, configurations) etc. for processor
ocrd_kraken rewrite for kraken 3.x
1 De-Keystoning / Page splitting (@hnesk via https://gitter.im/OCR-D/Lobby?at=5ee9ffc490cd6426c8116ab0 / @bertsky via https://gitter.im/OCR-D/Lobby?at=5ef482c2d65a3b0292ac4eb2)
Evaluation of calamari 0.3, 1.0 and tesseract in relation to segmentations (Mike Gerber)
Transitioning ocrd_all to slim containers
Efficiently working with preloaded models in processors
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →sbb_binarize now in ocrd_all
Wanted: Complex but realist example pages for upcoming regression tests
OLA-HD client
browse-ocrd - what it is and how it is used in a scan workflow
resegment inefficiency: https://github.com/cisocrgroup/ocrd_cis/issues/73#issuecomment-708945511
clipping inefficiency: https://github.com/cisocrgroup/ocrd_cis/issues/74#issuecomment-708708
Transitioning ocrd_all to slim containers
OCR-D Web API - what it is and what it isn't -> Glossary of terms
Technical questions in context of grant proposals deadline
(Fixing https://github.com/OCR-D/ocrd_all/issues/195)
OLA-HD client
Logging Refactoring
Diagnostics for workflows
Revisiting the OCR-D Web API draft
benchmarking
API / OCR-D Butler
CUDA: https://github.com/OCR-D/core/pull/454 / https://github.com/OCR-D/ocrd_all/pull/178
New and upcoming features
ocrd-wf
evaluation of workflow and intermediate results
API
Evaluation tools for Layout Analysis
New in v2.13.0:
Processor.input_files
can handle PAGE-XML and images in the same folder -> let's adapt all processorsWeb APIs for OCR-D: Who is planning to build them and how can we coordinate?
New features in 2.11.0+
ocrd-sanitize-mets
: Tool zum bereinigen von OCR-Dmets.xml
-> https://github.com/OCR-D/core/issue/544Building Debian/Ubuntu packages for OCR-D -> https://github.com/OCR-D/ocrd_all/issues/130
Solution for maintaining GT in git (LFS, git-annex, …?)
evaluation of workflow and intermediate results
1 ocrd-website wiki
1 image preprocessing new with ocrd_wrap, including generic shell wrapper
1 deploying models and presets
2 Handling rotation in metadata
3 Order of cropping and rotation
2 PAGE API extensions, esp. automatic polygon sanitation
1 GPU-enabled docker base image https://github.com/OCR-D/core/pull/452 / https://github.com/OCR-D/core/pull/454
3 Getting rid of –mets-basename
2 behavior if processor is called w/o args
3 ocrd_tool: file parameter relative filename resolution order (Issue: #160)
3 parameter presets
3 AlternativeImage in output fileGrp, AlternativeImage in structMap
3 repeatable -p
3 global mets:structMap/mets:fptr
3 structMap[@TYPE=OCR-D-LOGICAL] / FULLDOWNLOAD
3 uniform parameters for common settings
1
sample-calls
for ocrd-tool.json1 Building
deb
packages for OCR-D3 Managing OCR models
Evaluation of the OCR-D-processors on representative GT
documenting workflows (Konstantin)
checking and producing valid coordinates (Robert)
recursive/2nd-level regions (Robert)
derived images under same fileGrp (Robert)
Tensorflow 1.15 vs 2.0.0 vs 2.1.0
ignore PrintSpace (Robert)v2.7.03 ocrd-tool.json: structure of input/output groupClosed3 bashlib: check fo 4.4.+v2.8.23 processor –overwritev2.10.03 XSD validationv2.9.03 check file exists before ocrd workspace addv2.8.23 filename convention for bulk addv2.10.02 bashlib loggingv2.10.02 validate pcgtsid == mets:file/@IDv2.8.22 validate mets:file/@ID syntaxv2.9.02 ocrd_all release managementhttps://github.com/OCR-D/ocrd_all/releases(Problem) reports from testers
Conference details
Meeting Room Name: OCR-D Open TechCall
Link: https://meet.gwdg.de/b/eli-ufa-unu
Telephone number: will be posted right before each meeting in the Gitter Chatroom