吾爱破解 - 52pojie.cn

 找回密码
 注册[Register]

QQ登录

只需一步,快速开始

查看: 1347|回复: 45
上一主题 下一主题
收起左侧

[Python 原创] 图片文字识别重命名工具 - 腾讯OCR版

  [复制链接]
跳转到指定楼层
楼主
hfol85 发表于 2025-4-3 22:05 回帖奖励
本帖最后由 hfol85 于 2025-4-6 16:03 编辑

图片文字识别重命名工具 - 腾讯OCR版

功能介绍

这是一个基于腾讯云OCR服务的图片文字识别与重命名工具,主要功能包括:

  1. 文字识别:使用腾讯云OCR API识别图片中的文字内容
  2. 智能重命名:根据用户设置的关键词顺序提取内容并重命名文件
  3. 批量处理:支持一次性处理多张图片文件
  4. 配置保存:自动保存API密钥和关键词设置,方便下次使用
  5. 可视化界面:提供直观的GUI界面,可预览图片和查看识别结果

使用说明

基本操作步骤

  1. 填写API配置

    • 在"腾讯云OCR配置"区域输入您的SecretId和SecretKey
    • 这些信息可从腾讯云控制台获取
  2. 设置关键词

    • 在"关键字设置"区域输入您要识别的关键词,用逗号分隔
    • 例如:"编号,名称,日期"(顺序决定了文件名的组合方式)
  3. 选择图片

    • 点击"选择图片"按钮,选择要处理的图片文件
    • 支持多选,可一次性处理多张图片
  4. 设置选项

    • 勾选"自动重命名文件"以启用重命名功能
    • 勾选"添加序号后缀"可在文件名后添加序号防止重复
  5. 开始处理

    • 点击"识别并重命名"按钮开始处理
    • 处理进度会显示在进度条中
    • 识别结果和重命名信息会显示在结果区域
  6. 查看结果

    • 使用"上一张"/"下一张"按钮浏览不同图片
    • 结果区域会显示每张图片的识别内容和重命名情况

高级功能

  • 关键词顺序敏感:文件名将严格按照设置的关键词顺序组合
  • 自动处理分隔符:能自动处理常见分隔符如冒号、空格等
  • 文件名有效性检查:自动移除非法字符,确保生成有效的文件名
  • 防重复命名:自动添加序号防止文件名冲突

应用场景

  1. 办公文档管理

    • 将扫描的发票、合同等文档按编号、日期自动重命名
    • 例如:识别"编号:2023001 日期:20230403"生成"2023001_20230403.jpg"
  2. 产品图片整理

    • 电商产品图片按产品编号、名称自动命名
    • 例如:识别"产品编号:P1001 产品名称:无线耳机"生成"P1001_无线耳机.jpg"
  3. 证件照片归档

    • 身份证、学生证等证件照片按姓名、学号等信息命名
    • 例如:识别"姓名:张三 学号:20230001"生成"张三_20230001.jpg"
  4. 会议资料整理

    • 会议照片按会议名称、日期自动命名
    • 例如:识别"会议:季度总结 日期:20230403"生成"季度总结_20230403.jpg"

注意事项

  1. API限制

    • 需要有效的腾讯云OCR服务账号和API密钥
    • 注意API调用次数限制和费用问题
  2. 图片质量要求

    • 图片中的文字应清晰可辨
    • 建议使用高分辨率图片以获得更好的识别效果
  3. 关键词设置

    • 关键词应具有唯一性,避免误匹配
    • 关键词顺序决定了文件名的组合顺序
    • 中英文关键词均可识别
  4. 文件名限制

    • 自动移除Windows文件名非法字符
    • 文件名最大长度限制为100字符
  5. 处理性能

    • 大量图片处理可能需要较长时间
    • 网络状况会影响OCR API的响应速度
  6. 数据安全

    • API密钥会保存在本地配置文件中,请注意计算机安全
    • 敏感图片处理前请评估隐私风险
  7. 错误处理

    • 识别失败的文件会保留原名
    • 错误信息会显示在结果区域中
  8. 版本更新

    • 可通过"检查更新"功能查看是否有新版本
    • 新版本可能修复问题或增加功能


[Python] 纯文本查看 复制代码
001
002
003
004
005
006
007
008
009
010
011
012
013
014
015
016
017
018
019
020
021
022
023
024
025
026
027
028
029
030
031
032
033
034
035
036
037
038
039
040
041
042
043
044
045
046
047
048
049
050
051
052
053
054
055
056
057
058
059
060
061
062
063
064
065
066
067
068
069
070
071
072
073
074
075
076
077
078
079
080
081
082
083
084
085
086
087
088
089
090
091
092
093
094
095
096
097
098
099
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
import os
import tkinter as tk
from tkinter import filedialog, messagebox, ttk
from PIL import Image, ImageTk
import requests
import base64
import hashlib
import hmac
import time
import json
from threading import Thread
import logging
import configparser
from pathlib import Path
 
class ImageTextRenamerApp:
    def __init__(self, root):
        self.root = root
        self.root.title("图片文字识别重命名工具 - 腾讯OCR版")
        self.root.geometry("900x600")
         
        # 版本信息
        self.version = "1.2.1"
        self.release_date = "2025-4-03"
         
        # 初始化变量
        self.image_files = []
        self.current_index = 0
        self.secret_id = ""
        self.secret_key = ""
        self.keywords = "编号,名称,日期"  # 默认关键字(顺序敏感)
        self.processing = False
        self.config_file = Path.home() / ".tencent_ocr_renamer.ini"
         
        # 配置日志
        logging.basicConfig(level=logging.INFO)
        self.logger = logging.getLogger(__name__)
         
        # 加载配置
        self.load_config()
         
        # 创建菜单栏
        self.create_menu()
         
        # 创建GUI组件
        self.create_widgets()
         
        # 注册关闭事件
        self.root.protocol("WM_DELETE_WINDOW", self.on_closing)
     
    def create_menu(self):
        """创建菜单栏"""
        menubar = tk.Menu(self.root)
         
        # 文件菜单
        file_menu = tk.Menu(menubar, tearoff=0)
        file_menu.add_command(label="选择图片", command=self.select_images)
        file_menu.add_separator()
        file_menu.add_command(label="退出", command=self.on_closing)
        menubar.add_cascade(label="文件", menu=file_menu)
         
        # 帮助菜单
        help_menu = tk.Menu(menubar, tearoff=0)
        help_menu.add_command(label="使用帮助", command=self.show_help)
        help_menu.add_command(label="检查更新", command=self.check_update)
        help_menu.add_separator()
        help_menu.add_command(label="关于", command=self.show_about)
        menubar.add_cascade(label="帮助", menu=help_menu)
         
        self.root.config(menu=menubar)
     
    def show_help(self):
        """显示使用帮助"""
        help_text = f"""图片文字识别重命名工具 使用指南 (版本 {self.version})
 
【重要特性】文件名严格按设置的关键词顺序生成
 
一、基本功能
1. 使用腾讯云OCR识别图片中的文字
2. 根据关键词后的内容重命名文件
3. 文件名按您设置的关键词顺序组合
 
二、使用步骤
1. 填写腾讯云OCR的SecretId和SecretKey
2. 设置关键词顺序(如:编号,名称,日期)
3. 点击"选择图片"添加文件
4. 勾选选项:
   - 自动重命名文件(必选)
   - 添加序号后缀(防重复)
5. 点击"识别并重命名"
 
三、关键词设置技巧
1. 顺序决定文件名结构(如设"日期,名称"则生成"20231116_产品.jpg")
2. 用逗号分隔多个关键词
3. 关键词应具有唯一性(避免误匹配)
4. 中英文均可(如:ID,name,日期)
 
四、常见问题
1. 识别率低?尝试调整关键词或图片质量
2. 顺序不对?检查关键词设置顺序
3. 需要技术支持?联系开发者
"""
        self.show_info_dialog("使用帮助", help_text)
     
    def check_update(self):
        """检查更新"""
        messagebox.showinfo("检查更新", f"当前已是最新版本 ({self.version})")
 
    def show_about(self):
        """显示关于对话框"""
        about_text = f"""图片文字识别重命名工具 v{self.version}
 
【核心功能】按设定顺序组合关键词生成文件名
 
开发者:Hfol85
联系方式:hfol85 @吾爱破解论坛
发布日期:{self.release_date}
 
技术栈:
- 腾讯云OCR API
- Python 3.x
- tkinter GUI界面
- 多线程处理
"""
        self.show_info_dialog("关于", about_text)
 
    def show_info_dialog(self, title, message):
        """显示信息对话框"""
        dialog = tk.Toplevel(self.root)
        dialog.title(title)
        dialog.resizable(True, True)
         
        text = tk.Text(dialog, wrap=tk.WORD, padx=10, pady=10)
        text.insert(tk.END, message)
        text.config(state=tk.DISABLED)
        text.pack(fill=tk.BOTH, expand=True)
         
        btn_frame = tk.Frame(dialog)
        btn_frame.pack(fill=tk.X, pady=5)
         
        close_btn = tk.Button(btn_frame, text="关闭", command=dialog.destroy)
        close_btn.pack()
         
        dialog.geometry(f"600x400+{self.root.winfo_x()+100}+{self.root.winfo_y()+100}")
 
    def load_config(self):
        """从配置文件加载配置"""
        config = configparser.ConfigParser()
        if self.config_file.exists():
            try:
                config.read(self.config_file)
                self.secret_id = config.get('TENCENT', 'SecretId', fallback='')
                self.secret_key = config.get('TENCENT', 'SecretKey', fallback='')
                self.keywords = config.get('SETTINGS', 'Keywords', fallback='编号,名称,日期')
            except Exception as e:
                self.logger.error(f"加载配置文件失败: {str(e)}")
 
    def save_config(self):
        """保存配置到文件"""
        config = configparser.ConfigParser()
        config['TENCENT'] = {
            'SecretId': self.secret_id,
            'SecretKey': self.secret_key
        }
        config['SETTINGS'] = {
            'Keywords': self.keywords
        }
        try:
            with open(self.config_file, 'w') as f:
                config.write(f)
        except Exception as e:
            self.logger.error(f"保存配置文件失败: {str(e)}")
 
    def on_closing(self):
        """窗口关闭事件处理"""
        self.save_config()
        self.root.destroy()
 
    def create_widgets(self):
        """创建主界面组件"""
        # 左侧图片显示区域
        self.left_panel = tk.Frame(self.root, width=600, height=550, bg='white')
        self.left_panel.pack(side=tk.LEFT, fill=tk.BOTH, expand=True, padx=5, pady=5)
        self.left_panel.pack_propagate(False)
         
        self.image_label = tk.Label(self.left_panel, bg='white')
        self.image_label.pack(fill=tk.BOTH, expand=True)
         
        # 右侧控制区域
        self.right_panel = tk.Frame(self.root, width=300, height=550)
        self.right_panel.pack(side=tk.RIGHT, fill=tk.Y, padx=5, pady=5)
        self.right_panel.pack_propagate(False)
         
        # API配置区域
        self.api_frame = tk.LabelFrame(self.right_panel, text="腾讯云OCR配置")
        self.api_frame.pack(fill=tk.X, pady=5)
         
        tk.Label(self.api_frame, text="SecretId:").pack(anchor=tk.W)
        self.secret_id_entry = tk.Entry(self.api_frame)
        self.secret_id_entry.insert(0, self.secret_id)
        self.secret_id_entry.pack(fill=tk.X, padx=5)
         
        tk.Label(self.api_frame, text="SecretKey:").pack(anchor=tk.W)
        self.secret_key_entry = tk.Entry(self.api_frame, show="*")
        self.secret_key_entry.insert(0, self.secret_key)
        self.secret_key_entry.pack(fill=tk.X, padx=5)
         
        # 文件选择按钮
        self.select_btn = tk.Button(self.right_panel, text="选择图片", command=self.select_images)
        self.select_btn.pack(fill=tk.X, pady=5, padx=5)
         
        # 识别选项
        self.options_frame = tk.LabelFrame(self.right_panel, text="识别选项")
        self.options_frame.pack(fill=tk.X, pady=5, padx=5)
         
        self.rename_var = tk.IntVar(value=1)
        tk.Checkbutton(self.options_frame, text="自动重命名文件", variable=self.rename_var).pack(anchor=tk.W)
         
        self.suffix_var = tk.IntVar(value=0)
        tk.Checkbutton(self.options_frame, text="添加序号后缀", variable=self.suffix_var).pack(anchor=tk.W)
         
        # 关键字设置
        self.keywords_frame = tk.LabelFrame(self.right_panel, text="关键字设置(按顺序组合)")
        self.keywords_frame.pack(fill=tk.X, pady=5, padx=5)
         
        tk.Label(self.keywords_frame, text="用逗号分隔,如: 编号,名称,日期").pack(anchor=tk.W)
        self.keywords_entry = tk.Entry(self.keywords_frame)
        self.keywords_entry.insert(0, self.keywords)
        self.keywords_entry.pack(fill=tk.X, padx=5)
         
        # 进度条
        self.progress = ttk.Progressbar(self.right_panel, orient=tk.HORIZONTAL, mode='determinate')
        self.progress.pack(fill=tk.X, pady=5, padx=5)
         
        # 操作按钮
        self.process_btn = tk.Button(self.right_panel, text="识别并重命名", command=self.start_processing)
        self.process_btn.pack(fill=tk.X, pady=5, padx=5)
         
        # 结果显示
        self.result_frame = tk.LabelFrame(self.right_panel, text="识别结果")
        self.result_frame.pack(fill=tk.BOTH, expand=True, pady=5, padx=5)
         
        self.result_text = tk.Text(self.result_frame, height=10, wrap=tk.WORD)
        scrollbar = tk.Scrollbar(self.result_frame)
        scrollbar.pack(side=tk.RIGHT, fill=tk.Y)
        self.result_text.pack(fill=tk.BOTH, expand=True)
        self.result_text.config(yscrollcommand=scrollbar.set)
        scrollbar.config(command=self.result_text.yview)
         
        # 导航按钮
        self.nav_frame = tk.Frame(self.right_panel)
        self.nav_frame.pack(fill=tk.X, pady=5, padx=5)
         
        self.prev_btn = tk.Button(self.nav_frame, text="上一张", command=self.prev_image)
        self.prev_btn.pack(side=tk.LEFT, expand=True, padx=2)
         
        self.next_btn = tk.Button(self.nav_frame, text="下一张", command=self.next_image)
        self.next_btn.pack(side=tk.RIGHT, expand=True, padx=2)
         
        # 状态栏
        self.status_var = tk.StringVar()
        self.status_var.set("就绪")
        self.status_bar = tk.Label(self.root, textvariable=self.status_var, bd=1, relief=tk.SUNKEN, anchor=tk.W)
        self.status_bar.pack(side=tk.BOTTOM, fill=tk.X)
 
    def select_images(self):
        """选择图片文件"""
        if self.processing:
            messagebox.showwarning("警告", "正在处理中,请稍后再选择图片")
            return
             
        files = filedialog.askopenfilenames(
            title="选择图片文件",
            filetypes=[("图片文件", "*.jpg *.jpeg *.png *.bmp *.gif"), ("所有文件", "*.*")]
        )
         
        if files:
            self.image_files = list(files)
            self.current_index = 0
            self.show_current_image()
            self.status_var.set(f"已选择 {len(self.image_files)} 张图片")
            self.result_text.delete(1.0, tk.END)
 
    def show_current_image(self):
        """显示当前图片"""
        if not self.image_files:
            return
             
        try:
            image_path = self.image_files[self.current_index]
            img = Image.open(image_path)
             
            # 调整图片大小以适应窗口
            max_size = (550, 500)
            img.thumbnail(max_size, Image.LANCZOS)
             
            photo = ImageTk.PhotoImage(img)
            self.image_label.config(image=photo)
            self.image_label.image = photo  # 保持引用
             
            # 更新状态
            self.status_var.set(f"图片 {self.current_index + 1}/{len(self.image_files)}: {os.path.basename(image_path)}")
        except Exception as e:
            messagebox.showerror("错误", f"无法加载图片: {str(e)}")
 
    def prev_image(self):
        """显示上一张图片"""
        if self.processing:
            return
        if self.image_files and self.current_index > 0:
            self.current_index -= 1
            self.show_current_image()
 
    def next_image(self):
        """显示下一张图片"""
        if self.processing:
            return
        if self.image_files and self.current_index < len(self.image_files) - 1:
            self.current_index += 1
            self.show_current_image()
 
    def start_processing(self):
        """开始处理图片"""
        if self.processing:
            return
             
        if not self.image_files:
            messagebox.showwarning("警告", "请先选择图片文件")
            return
             
        self.secret_id = self.secret_id_entry.get().strip()
        self.secret_key = self.secret_key_entry.get().strip()
        self.keywords = self.keywords_entry.get().strip()
         
        if not self.secret_id or not self.secret_key:
            messagebox.showwarning("警告", "请输入腾讯云SecretId和SecretKey")
            return
             
        if not self.keywords:
            messagebox.showwarning("警告", "请输入至少一个关键字")
            return
             
        # 禁用按钮防止重复操作
        self.processing = True
        self.select_btn.config(state=tk.DISABLED)
        self.process_btn.config(state=tk.DISABLED)
        self.prev_btn.config(state=tk.DISABLED)
        self.next_btn.config(state=tk.DISABLED)
         
        # 重置进度条
        self.progress["value"] = 0
        self.progress["maximum"] = len(self.image_files)
         
        # 清空结果
        self.result_text.delete(1.0, tk.END)
         
        # 在新线程中处理
        Thread(target=self.process_images, daemon=True).start()
 
    def extract_keyword_contents(self, text):
        """按照设置的关键词顺序提取内容(核心功能)"""
        keywords = [kw.strip() for kw in self.keywords.split(',') if kw.strip()]
        extracted_contents = []
         
        for keyword in keywords:  # 严格按照设置顺序处理
            index = text.find(keyword)
            if index != -1:
                content_start = index + len(keyword)
                # 查找下一个关键词的位置作为截断点
                next_pos = None
                for kw in keywords:
                    pos = text.find(kw, content_start)
                    if pos != -1 and (next_pos is None or pos < next_pos):
                        next_pos = pos
                 
                content = text[content_start:next_pos].strip() if next_pos else text[content_start:].strip()
                # 去除开头可能的分隔符
                for sep in [":", ":", " "]:
                    if content.startswith(sep):
                        content = content[len(sep):].strip()
                        break
                # 取第一行内容
                extracted_contents.append(content.split('\n')[0].strip())
            else:
                extracted_contents.append("")  # 未找到则留空但保持顺序
         
        return extracted_contents
 
    def process_images(self):
        """处理所有图片"""
        try:
            for i, image_path in enumerate(self.image_files):
                # 更新进度
                self.root.after(0, lambda v=i+1: self.progress.configure(value=v))
                self.root.after(0, self.status_var.set,
                               f"正在处理 {i+1}/{len(self.image_files)}: {os.path.basename(image_path)}")
                 
                # 识别图片中的文字
                try:
                    recognized_text = self.recognize_text_with_tencent_ocr(image_path)
                    if not recognized_text:
                        recognized_text = "未识别到文字"
                     
                    # 显示完整识别结果
                    self.root.after(0, self.result_text.insert, tk.END,
                                   f"{os.path.basename(image_path)}:\n{recognized_text}\n\n")
                     
                    # 如果需要重命名
                    if self.rename_var.get():
                        # 提取所有关键字后面的内容
                        keyword_contents = self.extract_keyword_contents(recognized_text)
                         
                        # 过滤空内容并连接
                        filtered_contents = [c for c in keyword_contents if c]
                        new_name = "_".join(filtered_contents) if filtered_contents else "未找到关键字内容"
                         
                        if new_name:
                            new_name = self.generate_valid_filename(new_name)
                            if self.suffix_var.get():
                                new_name = f"{new_name}_{i+1:03d}"
                             
                            new_path = self.rename_file(image_path, new_name)
                            self.image_files[i] = new_path  # 更新文件列表中的路径
                            self.root.after(0, self.result_text.insert, tk.END,
                                          f"已重命名为: {os.path.basename(new_path)}\n\n")
                 
                except Exception as e:
                    self.logger.error(f"处理图片时出错: {str(e)}")
                    self.root.after(0, self.result_text.insert, tk.END,
                                  f"处理 {os.path.basename(image_path)} 时出错: {str(e)}\n\n")
                 
                # 更新当前显示的图片
                if i == self.current_index:
                    self.root.after(0, self.show_current_image)
             
            self.root.after(0, self.status_var.set, "处理完成")
             
        except Exception as e:
            self.logger.error(f"处理过程中发生错误: {str(e)}")
            self.root.after(0, messagebox.showerror, "错误", f"处理过程中发生错误: {str(e)}")
             
        finally:
            # 重新启用按钮
            self.processing = False
            self.root.after(0, self.select_btn.config, {'state': tk.NORMAL})
            self.root.after(0, self.process_btn.config, {'state': tk.NORMAL})
            self.root.after(0, self.prev_btn.config, {'state': tk.NORMAL})
            self.root.after(0, self.next_btn.config, {'state': tk.NORMAL})
 
    def recognize_text_with_tencent_ocr(self, image_path):
        """调用腾讯云OCR API识别图片中的文字"""
        try:
            # 读取图片并转换为base64
            with open(image_path, "rb") as image_file:
                image_data = image_file.read()
                image_base64 = base64.b64encode(image_data).decode('utf-8')
             
            # 腾讯云OCR API参数
            action = "GeneralBasicOCR"
            region = "ap-guangzhou"
            endpoint = "ocr.tencentcloudapi.com"
            service = "ocr"
            version = "2018-11-19"
            algorithm = "TC3-HMAC-SHA256"
             
            # 获取当前时间戳
            timestamp = int(time.time())
            date = time.strftime("%Y-%m-%d", time.gmtime(timestamp))
             
            # ************* 步骤 1:拼接规范请求串 *************
            http_request_method = "POST"
            canonical_uri = "/"
            canonical_querystring = ""
            canonical_headers = "content-type:application/json; charset=utf-8\n" + f"host:{endpoint}\n"
            signed_headers = "content-type;host"
             
            payload = {
                "ImageBase64": image_base64,
                "LanguageType": "auto"
            }
            payload_str = json.dumps(payload)
             
            hashed_request_payload = hashlib.sha256(payload_str.encode('utf-8')).hexdigest()
             
            canonical_request = (http_request_method + "\n" +
                               canonical_uri + "\n" +
                               canonical_querystring + "\n" +
                               canonical_headers + "\n" +
                               signed_headers + "\n" +
                               hashed_request_payload)
             
            # ************* 步骤 2:拼接待签名字符串 *************
            credential_scope = date + "/" + service + "/" + "tc3_request"
            hashed_canonical_request = hashlib.sha256(canonical_request.encode('utf-8')).hexdigest()
             
            string_to_sign = (algorithm + "\n" +
                            str(timestamp) + "\n" +
                            credential_scope + "\n" +
                            hashed_canonical_request)
             
            # ************* 步骤 3:计算签名 *************
            secret_date = hmac.new(("TC3" + self.secret_key).encode('utf-8'),
                                 date.encode('utf-8'), hashlib.sha256).digest()
            secret_service = hmac.new(secret_date, service.encode('utf-8'), hashlib.sha256).digest()
            secret_signing = hmac.new(secret_service, "tc3_request".encode('utf-8'), hashlib.sha256).digest()
            signature = hmac.new(secret_signing, string_to_sign.encode('utf-8'), hashlib.sha256).hexdigest()
             
            # ************* 步骤 4:拼接 Authorization *************
            authorization = (algorithm + " " +
                            "Credential=" + self.secret_id + "/" + credential_scope + ", " +
                            "SignedHeaders=" + signed_headers + ", " +
                            "Signature=" + signature)
             
            # ************* 发送请求 *************
            headers = {
                "Authorization": authorization,
                "Content-Type": "application/json; charset=utf-8",
                "Host": endpoint,
                "X-TC-Action": action,
                "X-TC-Version": version,
                "X-TC-Timestamp": str(timestamp),
                "X-TC-Region": region
            }
             
            response = requests.post(f"https://{endpoint}", headers=headers, data=payload_str, timeout=30)
             
            # 检查响应状态
            if response.status_code != 200:
                error_data = response.json()
                error_msg = error_data.get("Response", {}).get("Error", {}).get("Message", f"HTTP {response.status_code} 错误")
                raise Exception(f"腾讯OCR API请求错误: {error_msg}")
             
            result = response.json()
             
            # 提取识别结果
            text_detections = result.get("Response", {}).get("TextDetections", [])
            recognized_text = ""
             
            for detection in text_detections:
                recognized_text += detection.get("DetectedText", "") + "\n"
             
            return recognized_text.strip()
             
        except requests.exceptions.RequestException as e:
            error_msg = str(e)
            if hasattr(e, "response") and e.response is not None:
                try:
                    error_data = e.response.json()
                    error_msg = error_data.get("Response", {}).get("Error", {}).get("Message", error_msg)
                except:
                    pass
            raise Exception(f"API请求失败: {error_msg}")
        except Exception as e:
            raise Exception(f"识别过程中出错: {str(e)}")
 
    def generate_valid_filename(self, text, max_length=100):
        """从识别文本生成有效的文件名"""
        if not text:
            return None
         
        # 移除非法字符
        invalid_chars = '<>:"/\\|?*\n\r\t'
        for char in invalid_chars:
            text = text.replace(char, '')
         
        # 替换空格为下划线
        text = text.replace(' ', '_')
         
        # 缩短长度
        text = text.strip()
        if len(text) > max_length:
            text = text[:max_length]
         
        return text if text else "unnamed"
 
    def rename_file(self, old_path, new_name):
        """重命名文件"""
        dir_name = os.path.dirname(old_path)
        ext = os.path.splitext(old_path)[1]
         
        # 确保文件名唯一
        counter = 1
        base_new_name = new_name
        while True:
            new_path = os.path.join(dir_name, f"{base_new_name}{ext}")
            if not os.path.exists(new_path):
                break
            base_new_name = f"{new_name}_{counter}"
            counter += 1
         
        os.rename(old_path, new_path)
        return new_path
 
if __name__ == "__main__":
    root = tk.Tk()
    app = ImageTextRenamerApp(root)
    root.mainloop()


增加本地部署的OCR实现方案(paddlepaddleOCR方案)(没有封装)
[Python] 纯文本查看 复制代码
0001
0002
0003
0004
0005
0006
0007
0008
0009
0010
0011
0012
0013
0014
0015
0016
0017
0018
0019
0020
0021
0022
0023
0024
0025
0026
0027
0028
0029
0030
0031
0032
0033
0034
0035
0036
0037
0038
0039
0040
0041
0042
0043
0044
0045
0046
0047
0048
0049
0050
0051
0052
0053
0054
0055
0056
0057
0058
0059
0060
0061
0062
0063
0064
0065
0066
0067
0068
0069
0070
0071
0072
0073
0074
0075
0076
0077
0078
0079
0080
0081
0082
0083
0084
0085
0086
0087
0088
0089
0090
0091
0092
0093
0094
0095
0096
0097
0098
0099
0100
0101
0102
0103
0104
0105
0106
0107
0108
0109
0110
0111
0112
0113
0114
0115
0116
0117
0118
0119
0120
0121
0122
0123
0124
0125
0126
0127
0128
0129
0130
0131
0132
0133
0134
0135
0136
0137
0138
0139
0140
0141
0142
0143
0144
0145
0146
0147
0148
0149
0150
0151
0152
0153
0154
0155
0156
0157
0158
0159
0160
0161
0162
0163
0164
0165
0166
0167
0168
0169
0170
0171
0172
0173
0174
0175
0176
0177
0178
0179
0180
0181
0182
0183
0184
0185
0186
0187
0188
0189
0190
0191
0192
0193
0194
0195
0196
0197
0198
0199
0200
0201
0202
0203
0204
0205
0206
0207
0208
0209
0210
0211
0212
0213
0214
0215
0216
0217
0218
0219
0220
0221
0222
0223
0224
0225
0226
0227
0228
0229
0230
0231
0232
0233
0234
0235
0236
0237
0238
0239
0240
0241
0242
0243
0244
0245
0246
0247
0248
0249
0250
0251
0252
0253
0254
0255
0256
0257
0258
0259
0260
0261
0262
0263
0264
0265
0266
0267
0268
0269
0270
0271
0272
0273
0274
0275
0276
0277
0278
0279
0280
0281
0282
0283
0284
0285
0286
0287
0288
0289
0290
0291
0292
0293
0294
0295
0296
0297
0298
0299
0300
0301
0302
0303
0304
0305
0306
0307
0308
0309
0310
0311
0312
0313
0314
0315
0316
0317
0318
0319
0320
0321
0322
0323
0324
0325
0326
0327
0328
0329
0330
0331
0332
0333
0334
0335
0336
0337
0338
0339
0340
0341
0342
0343
0344
0345
0346
0347
0348
0349
0350
0351
0352
0353
0354
0355
0356
0357
0358
0359
0360
0361
0362
0363
0364
0365
0366
0367
0368
0369
0370
0371
0372
0373
0374
0375
0376
0377
0378
0379
0380
0381
0382
0383
0384
0385
0386
0387
0388
0389
0390
0391
0392
0393
0394
0395
0396
0397
0398
0399
0400
0401
0402
0403
0404
0405
0406
0407
0408
0409
0410
0411
0412
0413
0414
0415
0416
0417
0418
0419
0420
0421
0422
0423
0424
0425
0426
0427
0428
0429
0430
0431
0432
0433
0434
0435
0436
0437
0438
0439
0440
0441
0442
0443
0444
0445
0446
0447
0448
0449
0450
0451
0452
0453
0454
0455
0456
0457
0458
0459
0460
0461
0462
0463
0464
0465
0466
0467
0468
0469
0470
0471
0472
0473
0474
0475
0476
0477
0478
0479
0480
0481
0482
0483
0484
0485
0486
0487
0488
0489
0490
0491
0492
0493
0494
0495
0496
0497
0498
0499
0500
0501
0502
0503
0504
0505
0506
0507
0508
0509
0510
0511
0512
0513
0514
0515
0516
0517
0518
0519
0520
0521
0522
0523
0524
0525
0526
0527
0528
0529
0530
0531
0532
0533
0534
0535
0536
0537
0538
0539
0540
0541
0542
0543
0544
0545
0546
0547
0548
0549
0550
0551
0552
0553
0554
0555
0556
0557
0558
0559
0560
0561
0562
0563
0564
0565
0566
0567
0568
0569
0570
0571
0572
0573
0574
0575
0576
0577
0578
0579
0580
0581
0582
0583
0584
0585
0586
0587
0588
0589
0590
0591
0592
0593
0594
0595
0596
0597
0598
0599
0600
0601
0602
0603
0604
0605
0606
0607
0608
0609
0610
0611
0612
0613
0614
0615
0616
0617
0618
0619
0620
0621
0622
0623
0624
0625
0626
0627
0628
0629
0630
0631
0632
0633
0634
0635
0636
0637
0638
0639
0640
0641
0642
0643
0644
0645
0646
0647
0648
0649
0650
0651
0652
0653
0654
0655
0656
0657
0658
0659
0660
0661
0662
0663
0664
0665
0666
0667
0668
0669
0670
0671
0672
0673
0674
0675
0676
0677
0678
0679
0680
0681
0682
0683
0684
0685
0686
0687
0688
0689
0690
0691
0692
0693
0694
0695
0696
0697
0698
0699
0700
0701
0702
0703
0704
0705
0706
0707
0708
0709
0710
0711
0712
0713
0714
0715
0716
0717
0718
0719
0720
0721
0722
0723
0724
0725
0726
0727
0728
0729
0730
0731
0732
0733
0734
0735
0736
0737
0738
0739
0740
0741
0742
0743
0744
0745
0746
0747
0748
0749
0750
0751
0752
0753
0754
0755
0756
0757
0758
0759
0760
0761
0762
0763
0764
0765
0766
0767
0768
0769
0770
0771
0772
0773
0774
0775
0776
0777
0778
0779
0780
0781
0782
0783
0784
0785
0786
0787
0788
0789
0790
0791
0792
0793
0794
0795
0796
0797
0798
0799
0800
0801
0802
0803
0804
0805
0806
0807
0808
0809
0810
0811
0812
0813
0814
0815
0816
0817
0818
0819
0820
0821
0822
0823
0824
0825
0826
0827
0828
0829
0830
0831
0832
0833
0834
0835
0836
0837
0838
0839
0840
0841
0842
0843
0844
0845
0846
0847
0848
0849
0850
0851
0852
0853
0854
0855
0856
0857
0858
0859
0860
0861
0862
0863
0864
0865
0866
0867
0868
0869
0870
0871
0872
0873
0874
0875
0876
0877
0878
0879
0880
0881
0882
0883
0884
0885
0886
0887
0888
0889
0890
0891
0892
0893
0894
0895
0896
0897
0898
0899
0900
0901
0902
0903
0904
0905
0906
0907
0908
0909
0910
0911
0912
0913
0914
0915
0916
0917
0918
0919
0920
0921
0922
0923
0924
0925
0926
0927
0928
0929
0930
0931
0932
0933
0934
0935
0936
0937
0938
0939
0940
0941
0942
0943
0944
0945
0946
0947
0948
0949
0950
0951
0952
0953
0954
0955
0956
0957
0958
0959
0960
0961
0962
0963
0964
0965
0966
0967
0968
0969
0970
0971
0972
0973
0974
0975
0976
0977
0978
0979
0980
0981
0982
0983
0984
0985
0986
0987
0988
0989
0990
0991
0992
0993
0994
0995
0996
0997
0998
0999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
1145
1146
1147
1148
1149
1150
1151
1152
1153
1154
1155
1156
1157
1158
1159
1160
1161
1162
1163
1164
1165
1166
1167
1168
1169
1170
1171
1172
1173
1174
1175
1176
1177
1178
1179
1180
1181
1182
1183
1184
1185
1186
1187
1188
1189
1190
1191
1192
1193
1194
1195
1196
1197
1198
1199
1200
1201
1202
1203
1204
1205
1206
1207
1208
1209
1210
1211
1212
1213
1214
1215
1216
1217
1218
1219
1220
1221
1222
1223
1224
1225
1226
1227
1228
1229
1230
1231
1232
1233
1234
1235
1236
1237
1238
1239
1240
1241
1242
1243
1244
1245
1246
1247
1248
1249
1250
1251
1252
1253
1254
1255
1256
1257
1258
1259
1260
1261
1262
1263
1264
1265
1266
1267
1268
1269
1270
1271
1272
1273
1274
1275
1276
1277
1278
1279
1280
1281
1282
1283
1284
1285
1286
1287
1288
1289
1290
1291
1292
1293
1294
1295
1296
1297
1298
1299
1300
1301
1302
1303
1304
1305
1306
1307
1308
1309
1310
1311
1312
1313
1314
1315
1316
1317
1318
1319
1320
1321
1322
1323
1324
1325
1326
1327
1328
1329
1330
1331
1332
1333
1334
1335
1336
1337
1338
1339
1340
1341
1342
1343
import os
# 设置环境变量以解决OpenMP冲突
os.environ['KMP_DUPLICATE_LIB_OK'] = 'TRUE'
 
import tkinter as tk
from tkinter import filedialog, messagebox, ttk
from PIL import Image, ImageTk
import requests
import base64
import hashlib
import hmac
import time
import json
from threading import Thread
import logging
import configparser
from pathlib import Path
import subprocess
from enum import Enum, auto
 
# 尝试导入PaddleOCR
try:
    from paddleocr import PaddleOCR
    PADDLEOCR_AVAILABLE = True
except ImportError:
    PADDLEOCR_AVAILABLE = False
 
# 尝试导入tkinterdnd2
try:
    from tkinterdnd2 import DND_FILES, TkinterDnD
    TKDND_AVAILABLE = True
except ImportError:
    TKDND_AVAILABLE = False
 
class OcrMode(Enum):
    """OCR模式枚举"""
    TENCENT = auto()
    PADDLE = auto()
 
class ImageTextRenamerApp:
    def __init__(self, root):
        self.root = root
        self.root.title("图片文字识别重命名工具")
        self.root.geometry("1000x850")
         
        # 版本信息
        self.version = "1.3.0"
        self.release_date = "2025-4-05"
         
        # 初始化变量
        self.image_files = []
        self.current_index = 0
        self.secret_id = ""
        self.secret_key = ""
        self.keywords = "编号,名称,日期"
        self.processing = False
        self.config_file = Path.home() / ".ocr_renamer.ini"
         
        # OCR模式设置
        self.ocr_mode = OcrMode.TENCENT
        self.paddle_ocr = None
         
        # 重命名相关变量
        self.replace_files_list = []
        self.replace_current_index = 0
         
        # 配置日志
        logging.basicConfig(level=logging.INFO)
        self.logger = logging.getLogger(__name__)
         
        # 加载配置
        self.load_config()
         
        # 初始化PaddleOCR(如果需要)
        if self.ocr_mode == OcrMode.PADDLE and PADDLEOCR_AVAILABLE:
            self.initialize_paddle_ocr()
         
        # 启用拖放(如果可用)
        if TKDND_AVAILABLE:
            self.root.drop_target_register(DND_FILES)
            self.root.dnd_bind('<<Drop>>', self.handle_drop)
             
            # 创建拖放提示标签
            self.drop_label = ttk.Label(self.root, text="拖放图片到此处")
            self.drop_label.place(relx=0.5, rely=0.5, anchor='center')
        else:
            self.logger.warning("tkinterdnd2未安装,拖放功能不可用")
         
        # 创建界面
        self.create_widgets()
        self.root.protocol("WM_DELETE_WINDOW", self.on_closing)
     
    def create_menu(self):
        """创建菜单栏"""
        menubar = tk.Menu(self.root)
        self.root.config(menu=menubar)
         
        # 文件菜单
        file_menu = tk.Menu(menubar, tearoff=0)
        file_menu.add_command(label="选择图片", command=self.select_images)
        file_menu.add_separator()
        file_menu.add_command(label="退出", command=self.on_closing)
        menubar.add_cascade(label="文件", menu=file_menu)
         
        # 设置菜单
        settings_menu = tk.Menu(menubar, tearoff=0)
        settings_menu.add_command(label="OCR配置", command=self.show_ocr_config)
        settings_menu.add_command(label="关键词设置", command=self.show_keywords_config)
        menubar.add_cascade(label="设置", menu=settings_menu)
         
        # 帮助菜单
        help_menu = tk.Menu(menubar, tearoff=0)
        help_menu.add_command(label="使用帮助", command=self.show_help)
        help_menu.add_command(label="检查更新", command=self.check_update)
        help_menu.add_separator()
        help_menu.add_command(label="关于", command=self.show_about)
        menubar.add_cascade(label="帮助", menu=help_menu)
     
    def show_ocr_config(self):
        """显示OCR配置对话框"""
        dialog = tk.Toplevel(self.root)
        dialog.title("OCR配置")
        dialog.resizable(False, False)
         
        # 设置对话框位置
        dialog.geometry(f"500x400+{self.root.winfo_x()+100}+{self.root.winfo_y()+100}")
         
        # 创建主框架
        main_frame = tk.Frame(dialog, padx=20, pady=20)
        main_frame.pack(fill=tk.BOTH, expand=True, padx=10, pady=10)
         
        # OCR模式选择
        mode_frame = tk.LabelFrame(main_frame, text="OCR模式选择", padx=10, pady=10)
        mode_frame.pack(fill=tk.X, pady=(0, 15))
         
        ocr_mode_var = tk.IntVar(value=self.ocr_mode.value)
         
        tk.Radiobutton(
            mode_frame,
            text="腾讯云OCR (在线)",
            variable=ocr_mode_var,
            value=OcrMode.TENCENT.value
        ).pack(anchor=tk.W, pady=2)
         
        paddle_state = tk.NORMAL if PADDLEOCR_AVAILABLE else tk.DISABLED
        paddle_text = "PaddleOCR (本地)" if PADDLEOCR_AVAILABLE else "PaddleOCR (本地, 不可用)"
        paddle_radio = tk.Radiobutton(
            mode_frame,
            text=paddle_text,
            variable=ocr_mode_var,
            value=OcrMode.PADDLE.value,
            state=paddle_state
        )
        paddle_radio.pack(anchor=tk.W, pady=2)
         
        # API配置区域 (仅腾讯OCR需要)
        api_frame = tk.LabelFrame(main_frame, text="腾讯云OCR配置", padx=10, pady=10)
        api_frame.pack(fill=tk.X, pady=(0, 15))
         
        tk.Label(api_frame, text="SecretId:").pack(anchor=tk.W, pady=(0, 2))
        secret_id_entry = tk.Entry(api_frame)
        secret_id_entry.insert(0, self.secret_id)
        secret_id_entry.pack(fill=tk.X, pady=(0, 5))
         
        tk.Label(api_frame, text="SecretKey:").pack(anchor=tk.W, pady=(0, 2))
        secret_key_entry = tk.Entry(api_frame, show="*")
        secret_key_entry.insert(0, self.secret_key)
        secret_key_entry.pack(fill=tk.X, pady=(0, 5))
         
        # 按钮区域
        btn_frame = tk.Frame(main_frame, pady=15)
        btn_frame.pack(fill=tk.X)
         
        save_btn = tk.Button(btn_frame, text="保存",
                            command=lambda: self.save_ocr_config(
                                ocr_mode_var.get(),
                                secret_id_entry.get(),
                                secret_key_entry.get(),
                                dialog
                            ))
        save_btn.pack(side=tk.RIGHT, padx=5)
         
        cancel_btn = tk.Button(btn_frame, text="取消", command=dialog.destroy)
        cancel_btn.pack(side=tk.RIGHT, padx=5)
         
        # 使对话框模态
        dialog.transient(self.root)
        dialog.grab_set()
     
    def show_keywords_config(self):
        """显示关键词配置对话框"""
        dialog = tk.Toplevel(self.root)
        dialog.title("关键词设置")
        dialog.resizable(False, False)
         
        # 设置对话框位置
        dialog.geometry(f"500x300+{self.root.winfo_x()+100}+{self.root.winfo_y()+100}")
         
        # 创建主框架
        main_frame = tk.Frame(dialog, padx=20, pady=20)
        main_frame.pack(fill=tk.BOTH, expand=True, padx=10, pady=10)
         
        # 关键词设置
        keywords_frame = tk.LabelFrame(main_frame, text="关键词设置(按顺序组合)", padx=10, pady=10)
        keywords_frame.pack(fill=tk.X, pady=(0, 15))
         
        tk.Label(keywords_frame, text="用逗号分隔,如: 编号,名称,日期").pack(anchor=tk.W, pady=(0, 5))
        keywords_entry = tk.Entry(keywords_frame)
        keywords_entry.insert(0, self.keywords)
        keywords_entry.pack(fill=tk.X, pady=(0, 5))
         
        # 按钮区域
        btn_frame = tk.Frame(main_frame, pady=15)
        btn_frame.pack(fill=tk.X)
         
        save_btn = tk.Button(btn_frame, text="保存",
                            command=lambda: self.save_keywords_config(
                                keywords_entry.get(),
                                dialog
                            ))
        save_btn.pack(side=tk.RIGHT, padx=5)
         
        cancel_btn = tk.Button(btn_frame, text="取消", command=dialog.destroy)
        cancel_btn.pack(side=tk.RIGHT, padx=5)
         
        # 使对话框模态
        dialog.transient(self.root)
        dialog.grab_set()
     
    def create_widgets(self):
        """创建主界面组件"""
        # 创建标签页
        self.notebook = ttk.Notebook(self.root)
        self.notebook.pack(fill=tk.BOTH, expand=True, padx=10, pady=10)
         
        # OCR标签页
        self.ocr_frame = tk.Frame(self.notebook)
        self.notebook.add(self.ocr_frame, text="OCR识别")
         
        # 重命名标签页
        self.replace_frame = tk.Frame(self.notebook)
        self.notebook.add(self.replace_frame, text="重命名")
         
        # 创建OCR标签页组件
        self.create_ocr_widgets()
         
        # 创建重命名标签页组件
        self.create_replace_widgets()
     
    def create_ocr_widgets(self):
        """创建OCR标签页组件"""
        # 左侧图片显示区域
        self.left_panel = tk.Frame(self.ocr_frame, width=600, height=550)
        self.left_panel.pack(side=tk.LEFT, fill=tk.BOTH, expand=True, padx=10, pady=10)
        self.left_panel.pack_propagate(False)
         
        # 图片标题
        self.image_title = tk.Label(self.left_panel, text="图片预览")
        self.image_title.pack(fill=tk.X, pady=(0, 5))
         
        # 图片显示区域
        self.image_container = tk.Frame(self.left_panel, highlightbackground="#cccccc", highlightthickness=1)
        self.image_container.pack(fill=tk.BOTH, expand=True)
         
        self.image_label = tk.Label(self.image_container)
        self.image_label.pack(fill=tk.BOTH, expand=True, padx=5, pady=5)
         
        # 右侧控制区域
        self.right_panel = tk.Frame(self.ocr_frame, width=350, height=550)
        self.right_panel.pack(side=tk.RIGHT, fill=tk.Y, padx=10, pady=10)
        self.right_panel.pack_propagate(False)
         
        # OCR模式选择
        self.ocr_mode_frame = tk.LabelFrame(self.right_panel, text="OCR模式选择", padx=10, pady=10)
        self.ocr_mode_frame.pack(fill=tk.X, pady=5)
         
        self.ocr_mode_var = tk.IntVar(value=self.ocr_mode.value)
         
        tk.Radiobutton(
            self.ocr_mode_frame,
            text="腾讯云OCR (在线)",
            variable=self.ocr_mode_var,
            value=OcrMode.TENCENT.value,
            command=self.on_ocr_mode_change
        ).pack(anchor=tk.W, pady=2)
         
        paddle_state = tk.NORMAL if PADDLEOCR_AVAILABLE else tk.DISABLED
        paddle_text = "PaddleOCR (本地)" if PADDLEOCR_AVAILABLE else "PaddleOCR (本地, 不可用)"
        self.paddle_radio = tk.Radiobutton(
            self.ocr_mode_frame,
            text=paddle_text,
            variable=self.ocr_mode_var,
            value=OcrMode.PADDLE.value,
            command=self.on_ocr_mode_change,
            state=paddle_state
        )
        self.paddle_radio.pack(anchor=tk.W, pady=2)
         
        # API配置区域
        self.api_frame = tk.LabelFrame(self.right_panel, text="腾讯云OCR配置", padx=10, pady=10)
        self.api_frame.pack(fill=tk.X, pady=5)
         
        tk.Label(self.api_frame, text="SecretId:").pack(anchor=tk.W, pady=(0, 2))
        self.secret_id_entry = tk.Entry(self.api_frame)
        self.secret_id_entry.insert(0, self.secret_id)
        self.secret_id_entry.pack(fill=tk.X, pady=(0, 5))
         
        tk.Label(self.api_frame, text="SecretKey:").pack(anchor=tk.W, pady=(0, 2))
        self.secret_key_entry = tk.Entry(self.api_frame, show="*")
        self.secret_key_entry.insert(0, self.secret_key)
        self.secret_key_entry.pack(fill=tk.X, pady=(0, 5))
         
        # 文件选择按钮
        self.select_btn = tk.Button(self.right_panel, text="选择图片", command=self.select_images)
        self.select_btn.pack(fill=tk.X, pady=10, padx=5)
         
        # 识别选项
        self.options_frame = tk.LabelFrame(self.right_panel, text="识别选项", padx=10, pady=10)
        self.options_frame.pack(fill=tk.X, pady=5, padx=5)
         
        self.rename_var = tk.IntVar(value=1)
        tk.Checkbutton(self.options_frame, text="自动重命名文件", variable=self.rename_var).pack(anchor=tk.W, pady=2)
         
        self.suffix_var = tk.IntVar(value=0)
        tk.Checkbutton(self.options_frame, text="添加序号后缀", variable=self.suffix_var).pack(anchor=tk.W, pady=2)
         
        # 关键字设置
        self.keywords_frame = tk.LabelFrame(self.right_panel, text="关键字设置(按顺序组合)", padx=10, pady=10)
        self.keywords_frame.pack(fill=tk.X, pady=5, padx=5)
         
        tk.Label(self.keywords_frame, text="用逗号分隔,如: 编号,名称,日期").pack(anchor=tk.W, pady=(0, 5))
        self.keywords_entry = tk.Entry(self.keywords_frame)
        self.keywords_entry.insert(0, self.keywords)
        self.keywords_entry.pack(fill=tk.X, pady=(0, 5))
         
        # 进度条
        self.progress = ttk.Progressbar(self.right_panel, orient=tk.HORIZONTAL, mode='determinate')
        self.progress.pack(fill=tk.X, pady=10, padx=5)
         
        # 操作按钮
        self.process_btn = tk.Button(self.right_panel, text="识别并重命名", command=self.start_processing)
        self.process_btn.pack(fill=tk.X, pady=10, padx=5)
         
        # 结果显示
        self.result_frame = tk.LabelFrame(self.right_panel, text="识别结果", padx=10, pady=10)
        self.result_frame.pack(fill=tk.BOTH, expand=True, pady=5, padx=5)
         
        self.result_text = tk.Text(self.result_frame, height=10, wrap=tk.WORD)
        scrollbar = tk.Scrollbar(self.result_frame)
        scrollbar.pack(side=tk.RIGHT, fill=tk.Y)
        self.result_text.pack(fill=tk.BOTH, expand=True)
        self.result_text.config(yscrollcommand=scrollbar.set)
        scrollbar.config(command=self.result_text.yview)
         
        # 导航按钮
        self.nav_frame = tk.Frame(self.right_panel)
        self.nav_frame.pack(fill=tk.X, pady=10, padx=5)
         
        self.prev_btn = tk.Button(self.nav_frame, text="上一张", command=self.prev_image)
        self.prev_btn.pack(side=tk.LEFT, expand=True, padx=2)
         
        self.next_btn = tk.Button(self.nav_frame, text="下一张", command=self.next_image)
        self.next_btn.pack(side=tk.RIGHT, expand=True, padx=2)
         
        # 状态栏
        self.status_var = tk.StringVar()
        self.status_var.set("就绪")
        self.status_bar = tk.Label(self.root, textvariable=self.status_var, bd=1, relief=tk.SUNKEN, anchor=tk.W)
        self.status_bar.pack(side=tk.BOTTOM, fill=tk.X)
     
    def create_replace_widgets(self):
        """创建重命名标签页组件"""
        # 顶部控制面板
        control_frame = tk.Frame(self.replace_frame, padx=15, pady=15)
        control_frame.pack(fill=tk.X)
         
        # 文件选择按钮
        self.replace_select_btn = tk.Button(control_frame, text="选择文件", command=self.select_replace_files)
        self.replace_select_btn.pack(side=tk.LEFT, padx=5)
         
        # 关键词输入
        tk.Label(control_frame, text="关键词:").pack(side=tk.LEFT, padx=5)
        self.keyword_entry = tk.Entry(control_frame, width=20)
        self.keyword_entry.pack(side=tk.LEFT, padx=5)
        self.keyword_entry.insert(0, "名称")
         
        # 新名称输入
        tk.Label(control_frame, text="新名称:").pack(side=tk.LEFT, padx=5)
        self.new_name_entry = tk.Entry(control_frame, width=30)
        self.new_name_entry.pack(side=tk.LEFT, padx=5)
         
        # 添加编号选项
        self.replace_number_var = tk.IntVar(value=1)
        tk.Checkbutton(control_frame, text="添加编号", variable=self.replace_number_var).pack(side=tk.LEFT, padx=5)
         
        # 起始编号
        tk.Label(control_frame, text="起始:").pack(side=tk.LEFT, padx=5)
        self.replace_start_num = tk.Spinbox(control_frame, from_=1, to=9999, width=5)
        self.replace_start_num.pack(side=tk.LEFT, padx=5)
         
        # 位数设置
        tk.Label(control_frame, text="位数:").pack(side=tk.LEFT, padx=5)
        self.replace_digits_num = tk.Spinbox(control_frame, from_=1, to=5, width=3)
        self.replace_digits_num.pack(side=tk.LEFT, padx=5)
        self.replace_digits_num.delete(0, tk.END)
        self.replace_digits_num.insert(0, "3")
         
        # 重命名按钮
        self.replace_btn = tk.Button(control_frame, text="批量重命名", command=self.start_replace_renaming)
        self.replace_btn.pack(side=tk.LEFT, padx=5)
         
        # 中间显示区域
        display_frame = tk.Frame(self.replace_frame)
        display_frame.pack(fill=tk.BOTH, expand=True, padx=15, pady=10)
         
        # 左侧图片预览
        self.replace_image_frame = tk.LabelFrame(display_frame, text="图片预览", width=400, height=400)
        self.replace_image_frame.pack_propagate(False)
        self.replace_image_frame.pack(side=tk.LEFT, fill=tk.BOTH, expand=True, padx=(0, 10))
         
        self.replace_image_label = tk.Label(self.replace_image_frame)
        self.replace_image_label.pack(fill=tk.BOTH, expand=True)
         
        # 右侧文件列表
        list_frame = tk.LabelFrame(display_frame, text="文件列表 (共0个文件)", width=300)
        list_frame.pack_propagate(False)
        list_frame.pack(side=tk.RIGHT, fill=tk.BOTH)
         
        self.replace_file_listbox = tk.Listbox(list_frame, selectmode=tk.SINGLE)
        self.replace_file_listbox.pack(fill=tk.BOTH, expand=True)
        self.replace_file_listbox.bind('<<ListboxSelect>>', self.on_replace_file_select)
         
        scrollbar = tk.Scrollbar(self.replace_file_listbox)
        scrollbar.pack(side=tk.RIGHT, fill=tk.Y)
        self.replace_file_listbox.config(yscrollcommand=scrollbar.set)
        scrollbar.config(command=self.replace_file_listbox.yview)
         
        # 导航按钮
        nav_frame = tk.Frame(self.replace_frame, pady=10)
        nav_frame.pack(fill=tk.X, padx=15)
         
        self.replace_prev_btn = tk.Button(nav_frame, text="上一个", command=self.replace_prev_file)
        self.replace_prev_btn.pack(side=tk.LEFT, padx=5)
         
        self.replace_next_btn = tk.Button(nav_frame, text="下一个", command=self.replace_next_file)
        self.replace_next_btn.pack(side=tk.LEFT, padx=5)
         
        # 进度条
        self.replace_progress = ttk.Progressbar(self.replace_frame, orient=tk.HORIZONTAL, mode='determinate')
        self.replace_progress.pack(fill=tk.X, padx=15, pady=10)
         
        # 状态栏
        self.replace_status_var = tk.StringVar()
        self.replace_status_var.set("就绪")
        status_bar = tk.Label(self.replace_frame, textvariable=self.replace_status_var, bd=1, relief=tk.SUNKEN, anchor=tk.W)
        status_bar.pack(fill=tk.X, padx=15, pady=5)
 
    def on_ocr_mode_change(self):
        """OCR模式变更事件处理"""
        selected_mode = OcrMode(self.ocr_mode_var.get())
        if selected_mode != self.ocr_mode:
            self.ocr_mode = selected_mode
            self.save_config()
             
            # 如果切换到PaddleOCR且尚未初始化
            if self.ocr_mode == OcrMode.PADDLE and self.paddle_ocr is None and PADDLEOCR_AVAILABLE:
                self.initialize_paddle_ocr()
     
    def select_images(self):
        """选择图片文件"""
        if self.processing:
            messagebox.showwarning("警告", "正在处理中,请稍后再选择图片")
            return
             
        files = filedialog.askopenfilenames(
            title="选择图片文件",
            filetypes=[("图片文件", "*.jpg *.jpeg *.png *.bmp *.gif"), ("所有文件", "*.*")]
        )
         
        if files:
            self.image_files = list(files)
            self.current_index = 0
            self.show_current_image()
            self.status_var.set(f"已选择 {len(self.image_files)} 张图片")
            self.result_text.delete(1.0, tk.END)
 
    def show_current_image(self):
        """显示当前图片"""
        if not self.image_files:
            return
             
        try:
            image_path = self.image_files[self.current_index]
            img = Image.open(image_path)
             
            # 调整图片大小以适应窗口
            max_size = (550, 500)
            img.thumbnail(max_size, Image.LANCZOS)
             
            photo = ImageTk.PhotoImage(img)
            self.image_label.config(image=photo)
            self.image_label.image = photo  # 保持引用
             
            # 更新状态
            self.status_var.set(f"图片 {self.current_index + 1}/{len(self.image_files)}: {os.path.basename(image_path)}")
        except Exception as e:
            messagebox.showerror("错误", f"无法加载图片: {str(e)}")
 
    def prev_image(self):
        """显示上一张图片"""
        if self.processing:
            return
        if self.image_files and self.current_index > 0:
            self.current_index -= 1
            self.show_current_image()
 
    def next_image(self):
        """显示下一张图片"""
        if self.processing:
            return
        if self.image_files and self.current_index < len(self.image_files) - 1:
            self.current_index += 1
            self.show_current_image()
 
    def start_processing(self):
        """开始处理图片"""
        if self.processing:
            return
             
        if not self.image_files:
            messagebox.showwarning("警告", "请先选择图片文件")
            return
             
        self.secret_id = self.secret_id_entry.get().strip()
        self.secret_key = self.secret_key_entry.get().strip()
        self.keywords = self.keywords_entry.get().strip()
         
        # 如果是腾讯OCR模式,需要验证API密钥
        if self.ocr_mode == OcrMode.TENCENT:
            if not self.secret_id or not self.secret_key:
                messagebox.showwarning("警告", "请输入腾讯云SecretId和SecretKey")
                return
                 
        if not self.keywords:
            messagebox.showwarning("警告", "请输入至少一个关键字")
            return
             
        # 禁用按钮防止重复操作
        self.processing = True
        self.select_btn.config(state=tk.DISABLED)
        self.process_btn.config(state=tk.DISABLED)
        self.prev_btn.config(state=tk.DISABLED)
        self.next_btn.config(state=tk.DISABLED)
         
        # 重置进度条
        self.progress["value"] = 0
        self.progress["maximum"] = len(self.image_files)
         
        # 清空结果
        self.result_text.delete(1.0, tk.END)
         
        # 在新线程中处理
        Thread(target=self.process_images, daemon=True).start()
 
    def recognize_text(self, image_path):
        """根据当前OCR模式调用相应的识别方法"""
        if self.ocr_mode == OcrMode.TENCENT:
            return self.recognize_text_with_tencent_ocr(image_path)
        elif self.ocr_mode == OcrMode.PADDLE and self.paddle_ocr is not None:
            return self.recognize_text_with_paddle_ocr(image_path)
        else:
            raise Exception("当前OCR模式不可用")
 
    def recognize_text_with_paddle_ocr(self, image_path):
        """使用PaddleOCR识别图片中的文字"""
        try:
            result = self.paddle_ocr.ocr(image_path, cls=True)
            recognized_text = ""
             
            if result is not None:
                for line in result:
                    if line:  # 检查line是否为None
                        for detection in line:
                            if detection and detection[1]:  # detection[1]是识别结果和置信度
                                recognized_text += detection[1][0] + "\n"  # detection[1][0]是识别的文本
             
            return recognized_text.strip()
        except Exception as e:
            raise Exception(f"PaddleOCR识别失败: {str(e)}")
 
    def recognize_text_with_tencent_ocr(self, image_path):
        """调用腾讯云OCR API识别图片中的文字"""
        try:
            # 读取图片并转换为base64
            with open(image_path, "rb") as image_file:
                image_data = image_file.read()
                image_base64 = base64.b64encode(image_data).decode('utf-8')
             
            # 腾讯云OCR API参数
            action = "GeneralBasicOCR"
            region = "ap-guangzhou"
            endpoint = "ocr.tencentcloudapi.com"
            service = "ocr"
            version = "2018-11-19"
            algorithm = "TC3-HMAC-SHA256"
             
            # 获取当前时间戳
            timestamp = int(time.time())
            date = time.strftime("%Y-%m-%d", time.gmtime(timestamp))
             
            # ************* 步骤 1:拼接规范请求串 *************
            http_request_method = "POST"
            canonical_uri = "/"
            canonical_querystring = ""
            canonical_headers = "content-type:application/json; charset=utf-8\n" + f"host:{endpoint}\n"
            signed_headers = "content-type;host"
             
            payload = {
                "ImageBase64": image_base64,
                "LanguageType": "auto"
            }
            payload_str = json.dumps(payload)
             
            hashed_request_payload = hashlib.sha256(payload_str.encode('utf-8')).hexdigest()
             
            canonical_request = (http_request_method + "\n" +
                               canonical_uri + "\n" +
                               canonical_querystring + "\n" +
                               canonical_headers + "\n" +
                               signed_headers + "\n" +
                               hashed_request_payload)
             
            # ************* 步骤 2:拼接待签名字符串 *************
            credential_scope = date + "/" + service + "/" + "tc3_request"
            hashed_canonical_request = hashlib.sha256(canonical_request.encode('utf-8')).hexdigest()
             
            string_to_sign = (algorithm + "\n" +
                            str(timestamp) + "\n" +
                            credential_scope + "\n" +
                            hashed_canonical_request)
             
            # ************* 步骤 3:计算签名 *************
            secret_date = hmac.new(("TC3" + self.secret_key).encode('utf-8'),
                                 date.encode('utf-8'), hashlib.sha256).digest()
            secret_service = hmac.new(secret_date, service.encode('utf-8'), hashlib.sha256).digest()
            secret_signing = hmac.new(secret_service, "tc3_request".encode('utf-8'), hashlib.sha256).digest()
            signature = hmac.new(secret_signing, string_to_sign.encode('utf-8'), hashlib.sha256).hexdigest()
             
            # ************* 步骤 4:拼接 Authorization *************
            authorization = (algorithm + " " +
                            "Credential=" + self.secret_id + "/" + credential_scope + ", " +
                            "SignedHeaders=" + signed_headers + ", " +
                            "Signature=" + signature)
             
            # ************* 发送请求 *************
            headers = {
                "Authorization": authorization,
                "Content-Type": "application/json; charset=utf-8",
                "Host": endpoint,
                "X-TC-Action": action,
                "X-TC-Version": version,
                "X-TC-Timestamp": str(timestamp),
                "X-TC-Region": region
            }
             
            response = requests.post(f"https://{endpoint}", headers=headers, data=payload_str, timeout=30)
             
            # 检查响应状态
            if response.status_code != 200:
                error_data = response.json()
                error_msg = error_data.get("Response", {}).get("Error", {}).get("Message", f"HTTP {response.status_code} 错误")
                raise Exception(f"腾讯OCR API请求错误: {error_msg}")
             
            result = response.json()
             
            # 提取识别结果
            text_detections = result.get("Response", {}).get("TextDetections", [])
            recognized_text = ""
             
            for detection in text_detections:
                recognized_text += detection.get("DetectedText", "") + "\n"
             
            return recognized_text.strip()
             
        except requests.exceptions.RequestException as e:
            error_msg = str(e)
            if hasattr(e, "response") and e.response is not None:
                try:
                    error_data = e.response.json()
                    error_msg = error_data.get("Response", {}).get("Error", {}).get("Message", error_msg)
                except:
                    pass
            raise Exception(f"API请求失败: {error_msg}")
        except Exception as e:
            raise Exception(f"识别过程中出错: {str(e)}")
 
    def extract_keyword_contents(self, text):
        """按照设置的关键词顺序提取内容(核心功能)"""
        keywords = [kw.strip() for kw in self.keywords.split(',') if kw.strip()]
        extracted_contents = []
         
        for keyword in keywords:  # 严格按照设置顺序处理
            index = text.find(keyword)
            if index != -1:
                content_start = index + len(keyword)
                # 查找下一个关键词的位置作为截断点
                next_pos = None
                for kw in keywords:
                    pos = text.find(kw, content_start)
                    if pos != -1 and (next_pos is None or pos < next_pos):
                        next_pos = pos
                 
                content = text[content_start:next_pos].strip() if next_pos else text[content_start:].strip()
                # 去除开头可能的分隔符
                for sep in [":", ":", " "]:
                    if content.startswith(sep):
                        content = content[len(sep):].strip()
                        break
                # 取第一行内容
                extracted_contents.append(content.split('\n')[0].strip())
            else:
                extracted_contents.append("")  # 未找到则留空但保持顺序
         
        return extracted_contents
 
    def process_images(self):
        """处理所有图片"""
        try:
            for i, image_path in enumerate(self.image_files):
                # 更新进度
                self.root.after(0, lambda v=i+1: self.progress.configure(value=v))
                self.root.after(0, self.status_var.set,
                               f"正在处理 {i+1}/{len(self.image_files)}: {os.path.basename(image_path)}")
                 
                # 识别图片中的文字
                try:
                    recognized_text = self.recognize_text(image_path)
                    if not recognized_text:
                        recognized_text = "未识别到文字"
                     
                    # 显示完整识别结果
                    self.root.after(0, self.result_text.insert, tk.END,
                                   f"{os.path.basename(image_path)}:\n{recognized_text}\n\n")
                     
                    # 如果需要重命名
                    if self.rename_var.get():
                        # 提取所有关键字后面的内容
                        keyword_contents = self.extract_keyword_contents(recognized_text)
                         
                        # 过滤空内容并连接
                        filtered_contents = [c for c in keyword_contents if c]
                        new_name = "_".join(filtered_contents) if filtered_contents else "未找到关键字内容"
                         
                        if new_name:
                            new_name = self.generate_valid_filename(new_name)
                            if self.suffix_var.get():
                                new_name = f"{new_name}_{i+1:03d}"
                             
                            new_path = self.rename_file(image_path, new_name)
                            self.image_files[i] = new_path  # 更新文件列表中的路径
                            self.root.after(0, self.result_text.insert, tk.END,
                                          f"已重命名为: {os.path.basename(new_path)}\n\n")
                 
                except Exception as e:
                    self.logger.error(f"处理图片时出错: {str(e)}")
                    self.root.after(0, self.result_text.insert, tk.END,
                                  f"处理 {os.path.basename(image_path)} 时出错: {str(e)}\n\n")
                 
                # 更新当前显示的图片
                if i == self.current_index:
                    self.root.after(0, self.show_current_image)
             
            self.root.after(0, self.status_var.set, "处理完成")
             
            # 保存处理后的文件夹路径
            if self.image_files:
                folder_path = os.path.dirname(self.image_files[0])
                 
                # 显示处理完成的消息框,并在用户点击确定后打开文件夹
                def show_completion_and_open_folder():
                    result = messagebox.showinfo("处理完成", f"已处理 {len(self.image_files)} 个文件\n点击确定打开文件夹")
                    self.open_folder(folder_path)
                 
                self.root.after(0, show_completion_and_open_folder)
            else:
                self.root.after(0, messagebox.showinfo, "处理完成", f"已处理 {len(self.image_files)} 个文件")
             
        except Exception as e:
            self.logger.error(f"处理过程中发生错误: {str(e)}")
            self.root.after(0, messagebox.showerror, "错误", f"处理过程中发生错误: {str(e)}")
             
        finally:
            # 重新启用按钮
            self.processing = False
            self.root.after(0, self.select_btn.config, {'state': tk.NORMAL})
            self.root.after(0, self.process_btn.config, {'state': tk.NORMAL})
            self.root.after(0, self.prev_btn.config, {'state': tk.NORMAL})
            self.root.after(0, self.next_btn.config, {'state': tk.NORMAL})
 
    def generate_valid_filename(self, text, max_length=100):
        """从识别文本生成有效的文件名"""
        if not text:
            return None
         
        # 移除非法字符
        invalid_chars = '<>:"/\\|?*\n\r\t'
        for char in invalid_chars:
            text = text.replace(char, '')
         
        # 替换空格为下划线
        text = text.replace(' ', '_')
         
        # 缩短长度
        text = text.strip()
        if len(text) > max_length:
            text = text[:max_length]
         
        return text if text else "unnamed"
 
    def rename_file(self, old_path, new_name):
        """重命名文件"""
        dir_name = os.path.dirname(old_path)
        ext = os.path.splitext(old_path)[1]
         
        # 确保文件名唯一
        counter = 1
        base_new_name = new_name
        while True:
            new_path = os.path.join(dir_name, f"{base_new_name}{ext}")
            if not os.path.exists(new_path):
                break
            base_new_name = f"{new_name}_{counter}"
            counter += 1
         
        os.rename(old_path, new_path)
        return new_path
 
    def select_replace_files(self):
        """选择要重命名的文件"""
        if self.processing:
            messagebox.showwarning("警告", "正在处理中,请稍后再选择文件")
            return
             
        files = filedialog.askopenfilenames(title="选择要重命名的文件")
        if files:
            self.replace_files_list = list(files)
            self.replace_current_index = 0
            self.update_replace_file_list()
            self.show_replace_current_file()
            self.replace_status_var.set(f"已选择 {len(self.replace_files_list)} 个文件")
     
    def update_replace_file_list(self):
        """更新文件列表框"""
        self.replace_file_listbox.delete(0, tk.END)
        for file in self.replace_files_list:
            self.replace_file_listbox.insert(tk.END, os.path.basename(file))
        # 更新文件列表标题
        for child in self.replace_frame.winfo_children():
            if isinstance(child, tk.LabelFrame) and child.cget("text").startswith("文件列表"):
                child.config(text=f"文件列表 (共{len(self.replace_files_list)}个文件)")
                break
     
    def show_replace_current_file(self):
        """显示当前选中的文件"""
        if not self.replace_files_list:
            return
             
        file_path = self.replace_files_list[self.replace_current_index]
         
        # 如果是图片,显示预览
        if file_path.lower().endswith(('.png', '.jpg', '.jpeg', '.bmp', '.gif')):
            try:
                img = Image.open(file_path)
                img.thumbnail((400, 400))
                photo = ImageTk.PhotoImage(img)
                self.replace_image_label.config(image=photo)
                self.replace_image_label.image = photo
            except Exception as e:
                self.replace_image_label.config(image=None)
                self.replace_image_label.image = None
        else:
            self.replace_image_label.config(image=None)
            self.replace_image_label.image = None
         
        # 更新列表框选中状态
        self.replace_file_listbox.selection_clear(0, tk.END)
        self.replace_file_listbox.selection_set(self.replace_current_index)
        self.replace_file_listbox.see(self.replace_current_index)
         
        # 更新状态栏
        self.replace_status_var.set(f"文件 {self.replace_current_index + 1}/{len(self.replace_files_list)}: {os.path.basename(file_path)}")
     
    def on_replace_file_select(self, event):
        """当在列表框中选中文件时"""
        if not self.replace_files_list or self.processing:
            return
             
        selection = self.replace_file_listbox.curselection()
        if selection:
            self.replace_current_index = selection[0]
            self.show_replace_current_file()
     
    def replace_prev_file(self):
        """显示上一个文件"""
        if self.processing:
            return
        if self.replace_files_list and self.replace_current_index > 0:
            self.replace_current_index -= 1
            self.show_replace_current_file()
     
    def replace_next_file(self):
        """显示下一个文件"""
        if self.processing:
            return
        if self.replace_files_list and self.replace_current_index < len(self.replace_files_list) - 1:
            self.replace_current_index += 1
            self.show_replace_current_file()
     
    def start_replace_renaming(self):
        """开始批量重命名"""
        if self.processing:
            return
             
        if not self.replace_files_list:
            messagebox.showwarning("警告", "请先选择文件")
            return
             
        keyword = self.keyword_entry.get().strip()
        new_name = self.new_name_entry.get().strip()
         
        if not new_name:
            messagebox.showwarning("警告", "请输入新名称")
            return
             
        # 禁用按钮防止重复操作
        self.processing = True
        self.replace_select_btn.config(state=tk.DISABLED)
        self.replace_btn.config(state=tk.DISABLED)
        self.replace_prev_btn.config(state=tk.DISABLED)
        self.replace_next_btn.config(state=tk.DISABLED)
         
        # 重置进度条
        self.replace_progress["value"] = 0
        self.replace_progress["maximum"] = len(self.replace_files_list)
         
        # 在新线程中处理
        Thread(
            target=self.replace_rename_files,
            args=(keyword, new_name),
            daemon=True
        ).start()
     
    def replace_rename_files(self, keyword, new_name):
        """批量重命名文件"""
        try:
            new_files = []
            counter = int(self.replace_start_num.get())
            digits = int(self.replace_digits_num.get())
            add_number = self.replace_number_var.get()
             
            for i, file_path in enumerate(self.replace_files_list):
                # 更新进度
                self.root.after(0, lambda v=i+1: self.replace_progress.configure(value=v))
                self.root.after(0, self.replace_status_var.set,
                               f"正在处理 {i+1}/{len(self.replace_files_list)}: {os.path.basename(file_path)}")
                 
                # 获取文件信息
                dir_name = os.path.dirname(file_path)
                file_name, file_ext = os.path.splitext(os.path.basename(file_path))
                 
                # 构建新文件名
                if keyword:  # 如果有关键词,使用替换法
                    if keyword in file_name:
                        new_file_name = file_name.replace(keyword, new_name)
                    else:
                        new_file_name = f"{new_name}_{i+1}"
                else# 如果没有关键词,直接使用新名称
                    new_file_name = new_name
                 
                # 如果需要添加编号
                if add_number:
                    new_file_name = f"{new_file_name}_{counter:0{digits}d}"
                    counter += 1
                 
                # 确保文件名唯一
                unique_counter = 1
                base_new_name = new_file_name
                while True:
                    new_path = os.path.join(dir_name, f"{base_new_name}{file_ext}")
                    if not os.path.exists(new_path):
                        break
                    base_new_name = f"{new_file_name}_{unique_counter}"
                    unique_counter += 1
                 
                # 重命名文件
                os.rename(file_path, new_path)
                new_files.append(new_path)
                 
                # 更新显示
                self.root.after(0, self.replace_file_listbox.delete, i)
                self.root.after(0, self.replace_file_listbox.insert, i, os.path.basename(new_path))
                 
                # 如果是当前显示的文件,更新预览
                if i == self.replace_current_index:
                    self.root.after(0, lambda: self.replace_files_list.__setitem__(self.replace_current_index, new_path))
                    self.root.after(0, self.show_replace_current_file)
             
            self.replace_files_list = new_files
            self.root.after(0, self.replace_status_var.set, f"重命名完成,共处理 {len(self.replace_files_list)} 个文件")
             
            # 保存处理后的文件夹路径
            if self.replace_files_list:
                folder_path = os.path.dirname(self.replace_files_list[0])
                 
                # 显示处理完成的消息框,并在用户点击确定后打开文件夹
                def show_completion_and_open_folder():
                    result = messagebox.showinfo("重命名完成", f"已处理 {len(self.replace_files_list)} 个文件\n点击确定打开文件夹")
                    self.open_folder(folder_path)
                 
                self.root.after(0, show_completion_and_open_folder)
            else:
                self.root.after(0, messagebox.showinfo, "重命名完成", f"已处理 {len(self.replace_files_list)} 个文件")
             
        except Exception as e:
            self.root.after(0, messagebox.showerror, "错误", f"重命名过程中出错: {str(e)}")
            self.root.after(0, self.replace_status_var.set, "重命名过程中出错")
             
        finally:
            # 重新启用按钮
            self.processing = False
            self.root.after(0, self.replace_select_btn.config, {'state': tk.NORMAL})
            self.root.after(0, self.replace_btn.config, {'state': tk.NORMAL})
            self.root.after(0, self.replace_prev_btn.config, {'state': tk.NORMAL})
            self.root.after(0, self.replace_next_btn.config, {'state': tk.NORMAL})
 
    def open_folder(self, folder_path):
        """打开指定文件夹"""
        try:
            if os.path.exists(folder_path):
                # 使用系统默认的文件管理器打开文件夹
                if os.name == 'nt'# Windows
                    os.startfile(folder_path)
                else# macOS 和 Linux
                    subprocess.call(['open', folder_path])
            else:
                messagebox.showerror("错误", f"文件夹不存在: {folder_path}")
        except Exception as e:
            self.logger.error(f"打开文件夹时出错: {str(e)}")
            messagebox.showerror("错误", f"打开文件夹时出错: {str(e)}")
 
    def bind_shortcuts(self):
        """绑定快捷键"""
        # 文件操作快捷键
        self.root.bind('<Control-o>', lambda e: self.select_image())
        self.root.bind('<Control-s>', lambda e: self.save_result())
         
        # OCR操作快捷键
        self.root.bind('<Control-r>', lambda e: self.recognize_text())
        self.root.bind('<Control-c>', lambda e: self.copy_text())
         
        # 重命名操作快捷键
        self.root.bind('<Control-f>', lambda e: self.select_replace_files())
        self.root.bind('<Control-b>', lambda e: self.batch_rename())
         
        # 标签页切换快捷键
        self.root.bind('<Control-1>', lambda e: self.notebook.select(0))
        self.root.bind('<Control-2>', lambda e: self.notebook.select(1))
         
        # 配置快捷键
        self.root.bind('<Control-p>', lambda e: self.show_ocr_config())
 
    def handle_drop(self, event):
        """处理文件拖放"""
        file_path = event.data
        # 移除可能的大括号和引号
        file_path = file_path.strip('{}').strip('"')
         
        # 检查文件类型
        if file_path.lower().endswith(('.png', '.jpg', '.jpeg', '.bmp', '.gif')):
            self.image_path = file_path
            self.display_image()
            self.drop_label.place_forget()  # 隐藏拖放提示
        else:
            messagebox.showerror("错误", "请拖放图片文件(支持 PNG、JPG、JPEG、BMP、GIF 格式)")
             
    def display_image(self):
        """显示图片"""
        if self.image_path:
            try:
                # 打开并调整图片大小
                image = Image.open(self.image_path)
                # 计算调整后的大小,保持宽高比
                display_size = (400, 400)
                image.thumbnail(display_size, Image.Resampling.LANCZOS)
                 
                # 转换为PhotoImage
                photo = ImageTk.PhotoImage(image)
                 
                # 更新图片显示
                self.image_label.configure(image=photo)
                self.image_label.image = photo  # 保持引用
                 
                # 更新文件名显示
                self.filename_label.configure(text=f"文件名: {os.path.basename(self.image_path)}")
                 
                # 启用识别按钮
                self.recognize_button.configure(state='normal')
                 
            except Exception as e:
                messagebox.showerror("错误", f"无法加载图片: {str(e)}")
                self.image_path = None
                self.image_label.configure(image='')
                self.filename_label.configure(text="文件名: ")
                self.recognize_button.configure(state='disabled')
 
    def initialize_paddle_ocr(self):
        """初始化PaddleOCR"""
        if PADDLEOCR_AVAILABLE:
            try:
                # 优化PaddleOCR参数以提高识别准确率
                self.paddle_ocr = PaddleOCR(
                    use_angle_cls=True# 使用方向分类器
                    lang="ch"# 中文识别
                    det_model_dir=None# 使用默认检测模型
                    rec_model_dir=None# 使用默认识别模型
                    cls_model_dir=None# 使用默认分类模型
                    use_gpu=False# 使用CPU
                    enable_mkldnn=True# 启用MKL-DNN加速
                    det_db_thresh=0.3# 文本检测阈值
                    det_db_box_thresh=0.5# 文本检测框阈值
                    det_db_unclip_ratio=1.6# 文本检测框扩展比例
                    max_batch_size=10# 最大批处理大小
                    use_dilation=False# 不使用膨胀
                    det_db_score_mode="fast"# 快速评分模式
                    drop_score=0.5# 置信度阈值
                    rec_char_dict_path=None# 使用默认字符字典
                    show_log=False  # 不显示日志
                )
                self.logger.info("PaddleOCR初始化成功")
            except Exception as e:
                self.logger.error(f"PaddleOCR初始化失败: {str(e)}")
                messagebox.showerror("错误", f"PaddleOCR初始化失败: {str(e)}")
        else:
            self.logger.warning("PaddleOCR未安装")
            messagebox.showwarning("警告", "PaddleOCR未安装,请使用pip install paddlepaddle paddleocr安装")
 
    def save_ocr_config(self, ocr_mode_value, secret_id, secret_key, dialog):
        """保存OCR配置"""
        # 更新OCR模式
        new_mode = OcrMode(ocr_mode_value)
        if new_mode != self.ocr_mode:
            self.ocr_mode = new_mode
             
            # 如果切换到PaddleOCR且尚未初始化
            if self.ocr_mode == OcrMode.PADDLE and self.paddle_ocr is None and PADDLEOCR_AVAILABLE:
                self.initialize_paddle_ocr()
         
        # 更新API密钥
        self.secret_id = secret_id
        self.secret_key = secret_key
         
        # 保存配置
        self.save_config()
         
        # 更新界面
        self.ocr_mode_var.set(self.ocr_mode.value)
         
        # 关闭对话框
        dialog.destroy()
         
        # 显示成功消息
        messagebox.showinfo("成功", "OCR配置已保存")
 
    def save_keywords_config(self, keywords, dialog):
        """保存关键词配置"""
        self.keywords = keywords
        self.keywords_entry.delete(0, tk.END)
        self.keywords_entry.insert(0, self.keywords)
         
        # 保存配置
        self.save_config()
         
        # 关闭对话框
        dialog.destroy()
         
        # 显示成功消息
        messagebox.showinfo("成功", "关键词设置已保存")
 
    def show_help(self):
        """显示使用帮助"""
        help_text = f"""图片文字识别重命名工具 使用指南 (版本 {self.version})
 
【重要特性】文件名严格按设置的关键词顺序生成
 
一、基本功能
1. 支持腾讯云OCR和本地PaddleOCR两种识别方式
2. 根据关键词后的内容重命名文件
3. 文件名按您设置的关键词顺序组合
4. 支持批量文件重命名(带编号)
 
二、使用步骤
1. 选择OCR模式:
   - 腾讯云OCR(需要API密钥)
   - PaddleOCR(本地离线识别)
2. 设置关键词顺序(如:编号,名称,日期)
3. 点击"选择图片"添加文件
4. 勾选选项:
   - 自动重命名文件(必选)
   - 添加序号后缀(防重复)
5. 点击"识别并重命名"
 
三、关键词设置技巧
1. 顺序决定文件名结构(如设"日期,名称"则生成"20231116_产品.jpg")
2. 用逗号分隔多个关键词
3. 关键词应具有唯一性(避免误匹配)
4. 中英文均可(如:ID,name,日期)
 
四、重命名功能
1. 在"重命名"标签页中
2. 输入关键词和新名称
3. 可选择是否添加编号
4. 设置编号的起始值和位数
5. 点击"批量重命名"按钮
"""
        self.show_info_dialog("使用帮助", help_text)
 
    def check_update(self):
        """检查更新"""
        messagebox.showinfo("检查更新", f"当前已是最新版本 ({self.version})")
 
    def show_about(self):
        """显示关于对话框"""
        about_text = f"""图片文字识别重命名工具 v{self.version}
 
【核心功能】按设定顺序组合关键词生成文件名
 
开发者:Hfol85
联系方式:hfol85 @吾爱破解论坛
发布日期:{self.release_date}
 
技术栈:
- 腾讯云OCR API
- PaddleOCR 本地识别
- Python 3.x
- tkinter GUI界面
- 多线程处理
"""
        self.show_info_dialog("关于", about_text)
 
    def show_info_dialog(self, title, message):
        """显示信息对话框"""
        dialog = tk.Toplevel(self.root)
        dialog.title(title)
        dialog.resizable(True, True)
         
        # 设置对话框位置
        dialog.geometry(f"650x500+{self.root.winfo_x()+100}+{self.root.winfo_y()+100}")
         
        # 创建文本区域
        text_frame = tk.Frame(dialog, padx=15, pady=15)
        text_frame.pack(fill=tk.BOTH, expand=True, padx=10, pady=10)
         
        text = tk.Text(text_frame, wrap=tk.WORD, padx=10, pady=10)
        text.insert(tk.END, message)
        text.config(state=tk.DISABLED)
        text.pack(fill=tk.BOTH, expand=True)
         
        # 添加滚动条
        scrollbar = tk.Scrollbar(text_frame)
        scrollbar.pack(side=tk.RIGHT, fill=tk.Y)
        text.config(yscrollcommand=scrollbar.set)
        scrollbar.config(command=text.yview)
         
        # 按钮区域
        btn_frame = tk.Frame(dialog, pady=10)
        btn_frame.pack(fill=tk.X, padx=10)
         
        close_btn = tk.Button(btn_frame, text="关闭", command=dialog.destroy)
        close_btn.pack()
         
        # 使对话框模态
        dialog.transient(self.root)
        dialog.grab_set()
        self.root.wait_window(dialog)
 
    def load_config(self):
        """从配置文件加载配置"""
        config = configparser.ConfigParser()
        if self.config_file.exists():
            try:
                config.read(self.config_file)
                self.secret_id = config.get('TENCENT', 'SecretId', fallback='')
                self.secret_key = config.get('TENCENT', 'SecretKey', fallback='')
                self.keywords = config.get('SETTINGS', 'Keywords', fallback='编号,名称,日期')
                 
                # 加载OCR模式设置
                ocr_mode = config.get('SETTINGS', 'OcrMode', fallback='tencent')
                if ocr_mode.lower() == 'paddle' and PADDLEOCR_AVAILABLE:
                    self.ocr_mode = OcrMode.PADDLE
                else:
                    self.ocr_mode = OcrMode.TENCENT
            except Exception as e:
                self.logger.error(f"加载配置文件失败: {str(e)}")
 
    def save_config(self):
        """保存配置到文件"""
        config = configparser.ConfigParser()
        config['TENCENT'] = {
            'SecretId': self.secret_id,
            'SecretKey': self.secret_key
        }
        config['SETTINGS'] = {
            'Keywords': self.keywords,
            'OcrMode': 'paddle' if self.ocr_mode == OcrMode.PADDLE else 'tencent'
        }
        try:
            with open(self.config_file, 'w') as f:
                config.write(f)
        except Exception as e:
            self.logger.error(f"保存配置文件失败: {str(e)}")
 
    def on_closing(self):
        """窗口关闭事件处理"""
        self.save_config()
        self.root.destroy()
 
if __name__ == "__main__":
    if TKDND_AVAILABLE:
        root = TkinterDnD.Tk()
    else:
        root = tk.Tk()
        messagebox.showwarning("警告",
            "未检测到tkinterdnd2库,拖放功能将不可用。\n"
            "如需使用拖放功能,请安装: pip install tkinterdnd2")
     
    # 检查PaddleOCR是否可用
    if not PADDLEOCR_AVAILABLE:
        messagebox.showwarning("警告",
            "未检测到PaddleOCR库,将仅支持腾讯OCR模式。\n"
            "如需使用本地PaddleOCR,请安装: pip install paddlepaddle paddleocr")
     
    app = ImageTextRenamerApp(root)
    root.mainloop()







附成品发布地址(含新老版本下载链接):https://www.52pojie.cn/thread-2021137-1-1.html

免费评分

参与人数 7吾爱币 +12 热心值 +7 收起 理由
zylz9941 + 1 + 1 谢谢@Thanks!
苏紫方璇 + 7 + 1 欢迎分析讨论交流,吾爱破解论坛有你更精彩!
pbgz + 1 + 1 谢谢@Thanks!
grrr_zhao + 1 + 1 谢谢@Thanks!
lgc81034 + 1 谢谢@Thanks!
jfy168 + 1 + 1 我很赞同!
cesz123 + 1 + 1 谢谢@Thanks!

查看全部评分

本帖被以下淘专辑推荐:

发帖前要善用论坛搜索功能,那里可能会有你要找的答案或者已经有人发布过相同内容了,请勿重复发帖。

推荐
 楼主| hfol85 发表于 2025-4-5 09:52 |楼主
hdx001 发表于 2025-4-5 09:22
这个能离线使用吗?

暂时不行。后期可能会搞一个离线的本地部署版的。
推荐
 楼主| hfol85 发表于 2025-4-6 18:32 |楼主
hdx001 发表于 2025-4-5 09:58
这个好,联网的单位用不了

paddleOCR(本地识别)+腾讯云OCR在线识别的双识别的版本的程序源码放到原度盘分享链接了。感兴趣的可以下载下来试试。使用前要先在本地部署paddlepaddleOCR。
沙发
ov3r丶丶 发表于 2025-4-3 22:08
3#
tdyy 发表于 2025-4-3 22:19
感谢楼主,试试看
4#
aguai2008 发表于 2025-4-3 22:25
感谢楼主
5#
xiaowuyou 发表于 2025-4-3 22:27
试一试,感谢
6#
laoshizaoan 发表于 2025-4-3 22:31
感谢分享
7#
crystalZ 发表于 2025-4-3 22:34
谢谢分享
8#
52PJ070 发表于 2025-4-3 23:33
可以的,谢谢楼主分享!
9#
HHORT 发表于 2025-4-3 23:48
感谢楼主
10#
sizhan19861117 发表于 2025-4-4 04:25
感谢正好需要
您需要登录后才可以回帖 登录 | 注册[Register]

本版积分规则

返回列表

RSS订阅|小黑屋|处罚记录|联系我们|吾爱破解 - LCG - LSG ( 京ICP备16042023号 | 京公网安备 11010502030087号 )

GMT+8, 2025-4-10 06:03

Powered by Discuz!

Copyright © 2001-2020, Tencent Cloud.

快速回复 返回顶部 返回列表