【Python】通过百度API实现图片的文本识别

kuruite 发表于 2021-6-16 13:22

本帖最后由 kuruite 于 2021-6-16 13:24 编辑

#!/usr/bin/env python3
# -*- coding: utf-8 -*-
"""
<百度云>:<文本识别>
<通过百度云api识别图片中的文字>
:copyright: (c) 2021 by debris.
:license: GPLv3, see LICENSE File for more details.
"""
import base64
import requests

'''
access_token 获取
'''
# ak、sk参数配置,百度api上获取API_KEY、SECRET_KEY
API_KEY = 'XXXXXX'
SECRET_KEY = 'XXXXXX'

# client_id 为官网获取的AK， client_secret 为官网获取的SK
host = 'https://aip.baidubce.com/oauth/2.0/token?grant_type=client_credentials&client_id=%s&client_secret=%s' % (
API_KEY, SECRET_KEY)
response = requests.get(host)
access_token = response.json()['access_token']

'''
通用文字识别（高精度版）
'''
#
request_url = "https://aip.baidubce.com/rest/2.0/ocr/v1/accurate_basic"
# 二进制方式打开图片文件，填自己目录以及图片名称
f = open('./img/WechatIMG15289.jpeg', 'rb')

img = base64.b64encode(f.read())
params = {"image": img}
# access_token = '[调用鉴权接口获取的token]'
request_url = request_url + "?access_token=" + access_token
headers = {'content-type': 'application/x-www-form-urlencoded'}
response = requests.post(request_url, data=params, headers=headers).json()
outinfo = response['words_result']

#将识别到的文本写入到文件中
fo = open("./info.txt", "w")
for x in outinfo:
# fo.write(x['words'])
print(x['words'],file=fo)
fo.close()

husky9527 发表于 2021-6-16 13:44

感谢分享可以学习一下

QingYi. 发表于 2021-6-16 14:06

建议放一下成果图

santus36 发表于 2021-6-16 14:15

python有个ocr库叫cnocr，识别效果还是挺好的
https://github.com/breezedeus/cnocr/

xixiaomeng 发表于 2021-6-16 16:04

下载小王子 发表于 2021-6-16 17:26

这个得好好学习下正实用

zheshiweihe 发表于 2021-6-16 17:52

百度这些api还是可以用的，识别率挺高

woleo 发表于 2021-6-16 17:55

ysztcn 发表于 2021-6-17 08:37

学习学习

nappywu 发表于 2021-6-18 09:52

不错，写的很好

页: [1] 2

吾爱破解 - 52pojie.cn's Archiver

【Python】通过百度API实现图片的文本识别