扫描报告
5 /100
paddleocr-vl-locally
Complex document parsing with PaddleOCR — converts PDFs and document images into Markdown and JSON via a user-provided Triton inference endpoint.
PaddleOCR document parsing skill — clean implementation with no malicious behavior, no credential theft, no obfuscation, and honest documentation matching code.
可以安装
No action needed. The skill is safe to use as documented.
安全发现 1 项
| 严重性 | 安全发现 | 位置 |
|---|---|---|
| 低危 | Loose dependency version pins 供应链 | scripts/requirements.txt:4 |
| 资源类型 | 声明权限 | 推断权限 | 状态 | 证据 |
|---|---|---|---|---|
| 文件系统 | NONE | READ | ✓ 一致 | lib.py:141 — _load_file_as_base64 reads user-provided files |
| 网络访问 | NONE | WRITE | ✓ 一致 | lib.py:151 — httpx.Client.post to user-configured Triton endpoint |
| 命令执行 | NONE | NONE | — | No subprocess, no shell invocation anywhere in codebase |
| 环境变量 | READ | READ | ✓ 一致 | lib.py:47-57 — _get_env reads PADDLEOCR_DOC_PARSING_API_URL, PADDLEOCR_ACCESS_TO… |
7 项发现
中危 外部 URL 外部 URL
http://10.0.0.1:8020/v2/models/layout-parsing/infer SKILL.md:213 中危 外部 URL 外部 URL
http://10.0.133.33:8020/v2/models/layout-parsing/infer SKILL.md:223 中危 外部 URL 外部 URL
https://your-server.com/large_file.pdf SKILL.md:261 中危 外部 URL 外部 URL
http://www.apache.org/licenses/LICENSE-2.0 scripts/lib.py:7 中危 外部 URL 外部 URL
https://www.paddleocr.com scripts/smoke_test.py:42 中危 外部 URL 外部 URL
https://your-api-url.paddleocr.com/layout-parsing scripts/smoke_test.py:49 中危 外部 URL 外部 URL
https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/pp_structure_v3_demo.png scripts/smoke_test.py:115 目录结构
10 文件 · 44.2 KB · 1413 行 Python 5f · 974L
Markdown 2f · 422L
Text 2f · 12L
JSON 1f · 5L
├─
▾
references
│ └─
output_schema.md
Markdown
├─
▾
scripts
│ ├─
lib.py
Python
│ ├─
optimize_file.py
Python
│ ├─
requirements-optimize.txt
Text
│ ├─
requirements.txt
Text
│ ├─
smoke_test.py
Python
│ ├─
split_pdf.py
Python
│ └─
vl_caller.py
Python
├─
_meta.json
JSON
└─
SKILL.md
Markdown
依赖分析 3 项
| 包名 | 版本 | 来源 | 已知漏洞 | 备注 |
|---|---|---|---|---|
httpx | >=0.24.0 | pip | 否 | Version not pinned to exact release |
Pillow | >=10.0.0 | pip | 否 | Version not pinned to exact release |
pypdfium2 | >=4.0.0 | pip | 否 | Version not pinned to exact release |
安全亮点
✓ No shell execution, subprocess, or curl|bash patterns anywhere in the codebase
✓ No credential theft — environment variables are read-only and used only for API authentication
✓ No data exfiltration — all network calls go to the user-configured, user-provided Triton endpoint only
✓ No obfuscation — all code is plaintext Python, no base64/eval/atob patterns
✓ Documentation accurately reflects all implemented behavior (doc-to-code match)
✓ No access to sensitive paths (~/.ssh, ~/.aws, .env, etc.)
✓ No hidden instructions or steganographic content
✓ Filesystem writes are limited to user-specified output paths or the OS temp directory
✓ Error handling is thorough and returns structured errors without leaking system internals
✓ Apache 2.0 license from PaddlePaddle authors, consistent with PaddleOCR ecosystem