Add Mistral AI OCR script with test data and documentation

- ocr.php: two-step pipeline (mistral-ocr-latest + mistral-small-latest)
  extracts Serial Number, Model Number, and Date from part label photos
- input/: 5 test images of industrial part labels
- output/: corresponding YAML results
- README.md: full usage, setup, and troubleshooting docs
- .gitignore: excludes .env only
- .env.example: API key template

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
Elmar Sönser 2026-03-04 18:29:07 +01:00
commit 5bf9e065e4
14 changed files with 682 additions and 0 deletions

View file

@ -0,0 +1,12 @@
---
serial_number: Z1X6029781024
model_number: B69199Q
date: null
source_file: WIN_20260304_15_05_25_Pro.jpg
processed_at: "2026-03-04 17:26:30"
raw_ocr: |
| POCLAIN
Hydraulics | W/N: 0126 |
| --- | --- |
| P/N: B69199Q | W/N: 0126 |
| S/N: Z1X6029781024 | FN |

View file

@ -0,0 +1,19 @@
---
serial_number: Z1X6029782007
model_number: B69199Q
date: null
source_file: WIN_20260304_15_05_41_Pro.jpg
processed_at: "2026-03-04 17:26:32"
raw_ocr: |
# POCLAIN
Hydraulics
P/N: B69199Q
S/N: Z1X6029782007
![img-0.jpeg](img-0.jpeg)
W/N: 0126
FN
![img-1.jpeg](img-1.jpeg)

View file

@ -0,0 +1,22 @@
---
serial_number: 2506053021331
model_number: 38E3470018G1
date: null
source_file: WIN_20260304_15_06_20_Pro.jpg
processed_at: "2026-03-04 17:26:34"
raw_ocr: |
# POWER CODE
IN-FIELD SUPPORT by VANGUARD™
basco.com/patents • data rates apply
Serial #:
2506053021331
Model Number:
38E3470018G1
![img-0.jpeg](img-0.jpeg)
![img-1.jpeg](img-1.jpeg)

View file

@ -0,0 +1,33 @@
---
serial_number: 25101001230300
model_number: 10418
date: 2025/04
source_file: WIN_20260304_15_09_52_Pro.jpg
processed_at: "2026-03-04 17:26:37"
raw_ocr: |
# TTControl
HYDAC INTERNATIONAL
EU contact: TTControl GmbH, Schönbrunner Str. 7, 1040 Vienna AT
UK contact: HYDAC Technology Ltd, De Havilland Way, Windrush Park, OX29 0YG Witney, UK
HY-TTC 60-CD-594K-768K-0000-000
Version: 01.00-D SW: 623 Date: 2025/04
Voltage: +12/24V S/N: 25101001230300
![img-0.jpeg](img-0.jpeg)
![img-1.jpeg](img-1.jpeg)
010005D
10R-04 0021
CODESYS®
Made in Hungary
P/N: [10418] 921088

View file

@ -0,0 +1,9 @@
---
serial_number: null
model_number: null
date: null
source_file: d885193d-5e69-4823-aa08-cacb618b3dd1.jpg
processed_at: "2026-03-04 17:26:38"
raw_ocr: |
| ![img-0.jpeg](img-0.jpeg) | | |
| --- | --- | --- |