700字范文 > AI--调用百度OCR文字识别API进行图片文字识别

AI--调用百度OCR文字识别API进行图片文字识别

时间：2022-03-01 20:46:07

1 OCR文字识别

百度文字识别OCR提供多场景、多语种、高精度的文字检测与识别服务，多项ICDAR指标居世界第一。广泛适用于远程身份认证、文档电子化、金融保险、法律政务、财税报销、快递物流、交通出行、教育培训等场景，让应用看图识字，提升输入效率，优化用户体验，为企业降本增效。

2 如何调用

2.1 创建应用

在百度智能云控制台创建一个应用

每个应用都有产生独有的AppID、API Key、Secret Key

2.2 安装baidu-aip

pip install baidu-aip -i https://pypi.tuna./simple/

2.3 代码示例

#!/usr/bin/env python# *_* coding :UTF-8 *_*import osfrom aip import AipOcr# 定义常量, 自己申请的应用信息APP_ID = '24751112' # AppIDAPI_KEY = '497yL879Wa0QjIp65o04qfzn' # API KeySECRET_KEY = 'jgB76yC6hVSqqoFp34HgQRYFgGT2Zh0T' # Secret Keyclient = AipOcr(APP_ID, API_KEY, SECRET_KEY) # 初始化AipFace对象path = "F:/tmp/pics" # 读取图片all_picture_path = os.listdir(path) # 统计文件下图片个数file = open("F:/tmp/orc_info.txt", 'w', encoding='utf-8') # 打开你一会需要把信息写入的文件def get_file_content(picture_path):with open(picture_path, 'rb') as fp:return fp.read()options = {"language_type": "CHN_ENG", "detect_direction": "true", "detect_language": "true", "probability": "true"}# 迭代识别文件下的所有图片for path in all_picture_path:picture_path = os.path.join('F:', 'tmp', 'pics', path)print(picture_path)result = client.basicGeneral(get_file_content(picture_path), options) # 接收aip返回的识别结果reg_res = "识别结果:\n"print(result)if 'words_result' in result:for seg in result['words_result']:reg_res += seg['words']file.write(picture_path + "\n")file.write(reg_res + "\n")file.close()

可以根据实际情况，调用不同的方法，进行文字识别