文档 Experience Platform Intelligent Services 指南

文本存在和光学字符识别

Last update: Tue Jul 16 2024 00:00:00 GMT+0000 (Coordinated Universal Time)

创建对象：

用户

当给定图像时，文本存在/光学字符识别(OCR)服务可以指示图像中是否存在文本。如果存在文本，则OCR可以返回文本。

在本文档显示的示例请求中使用了以下图像：

示例图像

API格式

POST /services/v2/predict

请求

以下请求根据有效负载中提供的输入图像检查文本是否存在。有关显示的输入参数的更多信息，请参阅示例有效负载下方的表。

使用内联图像执行：

curl -w'\n' -i -X POST https://sensei.adobe.io/services/v2/predict \
-H 'Prefer: respond-async, wait=59' \
-H "x-api-key: $API_KEY" \
-H "content-type: multipart/form-data" \
-H "authorization: Bearer $API_TOKEN" \
-F file=@sample_image.png \
-F 'contentAnalyzerRequests={
  "sensei:name": "Feature:cintel-object-detection:Service-b9ace8b348b6433e9e7d82371aa16690",
  "sensei:invocation_mode": "asynchronous",
  "sensei:invocation_batch": false,
  "sensei:engines": [
    {
      "sensei:execution_info": {
        "sensei:engine": "Feature:cintel-object-detection:Service-b9ace8b348b6433e9e7d82371aa16690"
      },
      "sensei:inputs": {
        "documents": [
        {
          "sensei:multipart_field_name": "file",
          "dc:format": "image/jpg"
        }
        ]
      },
      "sensei:params": {
        "correct_with_dictionary": true,
        "min_probability": 0.2,
        "min_relevance": 0.01,
        "filter_with_dictionary": true
      },
      "sensei:outputs":{
        "result" : {
          "sensei:multipart_field_name" : "result",
          "dc:format": "application/json"
        }
      }
    }
  ]
}'

响应

成功的响应返回在请求中传递的每个图像的tags列表中检测到的文本。如果特定图像中没有文本，则is_text_present为0，tags为空列表。

[result0， result1， …]：每个输入文档的响应列表。每个结果都是一个带有键的命令：

request_element_id：此响应的输入文件对应的索引，0表示请求文档列表中的第一个图像，1表示下一个图像，依此类推。
标记：词典列表，每个词典有两个键：文本（从图像识别的单词）和相关性（计算为提取文本边界框区域相对于完整图像的部分）。 0.01将翻译为至少占图像1%的文本。
is_text_present： 0或1，具体取决于图像中是否存在文本。如果标记为0，则列表为空。

{
  "contentAnalyzerResponse": {
    "statuses": [
      {
        "sensei:engine": "Feature:cintel-object-detection:Service-b9ace8b348b6433e9e7d82371aa16690",
        "invocations": [
          {
            "sensei:outputs": {
              "result": {
                "sensei:multipart_field_name": "result",
                "dc:format": "application/json"
              }
            },
            "message": null,
            "status": "200"
          }
        ]
      }
    ],
    "request_id": "dttklFR7DPtMtEmjlRSx5BYP5WGg3tTx"
  },
  "result": [
    {
      "is_text_present": 1,
      "tags": [
        {
          "text": "yosemite",
          "relevance": 0.06
        }
      ],
      "request_element_id": 0
    }
  ]
}

请求

以下请求根据有效负载中提供的输入图像检查文本是否存在。有关显示的输入参数的更多信息，请参阅示例有效负载下方的表。

使用URL执行：

curl -w'\n' -i -X POST https://sensei.adobe.io/services/v2/predict \
-H 'Prefer: respond-async, wait=59' \
-H "x-api-key: $API_KEY" \
-H "content-type: multipart/form-data" \
-H "authorization: Bearer $API_TOKEN" \
-F 'contentAnalyzerRequests={
  "sensei:name": "Feature:cintel-object-detection:Service-b9ace8b348b6433e9e7d82371aa16690",
  "sensei:invocation_mode": "asynchronous",
  "sensei:invocation_batch": false,
  "sensei:engines": [
    {
      "sensei:execution_info": {
        "sensei:engine": "Feature:cintel-object-detection:Service-b9ace8b348b6433e9e7d82371aa16690"
      },
      "sensei:inputs": {
        "documents": [
        {
          "repo:path": <IMG_URL_PATH>,
          "sensei:repoType": "HTTP",
          "dc:format": "image/jpg"
        }
        ]
      },
      "sensei:params": {
        "correct_with_dictionary": true
      },
      "sensei:outputs":{
        "result" : {
          "sensei:multipart_field_name" : "result",
          "dc:format": "application/json"
        }
      }
    }
  ]
}'

{
  "contentAnalyzerResponse": {
    "statuses": [
      {
        "sensei:engine": "Feature:cintel-object-detection:Service-b9ace8b348b6433e9e7d82371aa16690",
        "invocations": [
          {
            "sensei:outputs": {
              "result": {
                "sensei:multipart_field_name": "result",
                "dc:format": "application/json"
              }
            },
            "message": null,
            "status": "200"
          }
        ]
      }
    ],
    "request_id": "ZbdhcK0JqS4Wg1wGdlEHGR3JOm530YNn"
  },
  "result": [
    {
      "is_text_present": 0,
      "tags": [],
      "request_element_id": 0
    }
  ]
}

属性

描述

必需

documents

JSON元素列表，列表中的每一项代表一个图像。作为此列表的一部分传递的任何参数都会覆盖在列表外部指定的相应列表元素的全局参数。

是

sensei:multipart_field_name

从中读取输入文件路径的field_name。

是

repo:path

指向图像资产的预签名URL。

是

sensei:repoType

"HTTP"（表示预签名的url）。

否

dc:format

输入图像的编码格式。图像编码仅允许使用图像格式，如jpeg、jpg、png和tiff。 dc：format与允许的格式匹配。

否

correct_with_dictionary

要不要用英语词典来纠正这些字词？如果未打开此功能，则可能会识别出非英语单词。默认值为True：已打开。) 请注意，打开词典时，没有必要总是得到一个英文单词。我们尝试更正它，但如果在一定编辑距离内不可能更正它，我们返回原始单词。

否

filter_with_dictionary

是否过滤单词，使其仅包含英语词典中的单词？如果启用此项，则返回的单词将始终属于大英语，它包含47万个单词。

否

min_probability

被识别的单词的最小概率是多少？服务只返回从图像中提取的概率大于min_probability的单词。默认值设置为0.2。

否

min_relevance

对于已识别的单词，最小相关性是多少？服务只返回从图像中提取的具有比min_relevance更大的相关性的单词。默认值设置为0.01。相关性计算为提取文本边界框的区域与完整图像相比的分数。 0.01将翻译为至少占图像1%的文本。

否

名称

数据类型

必需

默认

值

描述

repo:path

字符串

需要从中提取文本的图像预签名URL。

sensei:repoType

字符串

HTTPS

存储映像的存储库类型。

sensei:multipart_field_name

字符串

将图像作为多部分参数传递时，请使用此选项，而不是使用带预签名的url。

dc:format

字符串

是

"image/jpg"，
"image/jpeg"，
"image/png"，
"image/tiff"

在处理之前，将根据允许的输入编码类型检查图像编码。

recommendation-more-help

8959a20a-a58f-4057-9f82-870706c576e9