Determining Whether Documents Are PDF/A-Compliant determining-whether-documents-are-pdf-a-compliant
You can determine whether a PDF document is PDF/A-compliant by using the Assembler service. A PDF/A document exists as an archival format meant for long-term preservation of the document’s content. The fonts are embedded within the document, and the file is uncompressed. As a result, a PDF/A document is typically larger than a standard PDF document. Also, a PDF/A document does not contain audio and video content.
The PDF/A-1 specification consists of two levels of conformance, namely A and B. The major difference between the two levels is the logical structure (accessibility) support, which is not required for conformance level B. Regardless of the conformance level, PDF/A-1 dictates that all fonts are embedded within the generated PDF/A document. At this time, only PDF/A-1b is supported in validation (and conversion).
For the purpose of this discussion, assume that the following DDX document is used.
<?xml version="1.0" encoding="UTF-8"?>
<DDX xmlns="https://ns.adobe.com/DDX/1.0/">
<DocumentInformation source="Loan.pdf" result="Loan_result.xml">
<PDFAValidation compliance="PDF/A-1b" resultLevel="Detailed" ignoreUnusedResources="true" allowCertificationSignatures="true" />
</DocumentInformation>
</DDX>
Within this DDX document, the DocumentInformation element instructs the Assembler service to return information about the input PDF document. Within the DocumentInformation element, the PDFAValidation element instructs the Assembler service to indicate whether the input PDF document is PDF/A-compliant.
The Assembler service returns information that specifies whether the input PDF document is PDF/A-compliant within an XML document that contains a PDFAConformance element. If the input PDF document is PDF/A-compliant, the value of the PDFAConformance element’s isCompliant attribute is true. If the PDF document is not PDF/A-compliant, the value of the PDFAConformance element’s isCompliant attribute is false.
DocumentInformation element, the Assembler service returns XML data instead of a PDF document. That is, the Assembler service does not assemble or disassemble a PDF document; it returns information about the input PDF document within an XML document.Summary of steps summary-of-steps
To determine whether a PDF document is PDF/A-compliant, perform the following tasks:
- Include project files.
- Create a PDF Assembler client.
- Reference an existing DDX document.
- Reference a PDF document used to determine PDF/A compliancy.
- Set run-time options.
- Retrieve information about the PDF document.
- Save the returned XML document.
Include project files
Include the necessary files in your development project. If you are creating a client application by using Java, include the necessary JAR files. If you are using web services, ensure that you include the proxy files.
The following JAR files must be added to your project’s class path:
- adobe-livecycle-client.jar
- adobe-usermanager-client.jar
- adobe-assembler-client.jar
- adobe-utilities.jar (required if AEM Forms is deployed on JBoss)
- jbossall-client.jar (required if AEM Forms is deployed on JBoss)
if AEM Forms is deployed on a supported J2EE application server other than JBoss, you must replace the adobe-utilities.jar and jbossall-client.jar files with JAR files that are specific to the J2EE application server that AEM Forms is deployed on. For information about the location of all AEM Forms JAR files, see Including AEM Forms Java library files.
Create a PDF Assembler client
Before you can programmatically perform an Assembler operation, you must create an Assembler service client.
Reference an existing DDX document
A DDX document must be referenced to perform an Assembler service operation. To determine whether an input PDF document is PDF/A-compliant, ensure that the DDX document contains the PDFAValidation element within a DocumentInformation element. The PDFAValidation element instructs the Assembler service to return an XML document that specifies whether the input PDF document is PDF/A-compliant.
Reference a PDF document used to determine PDF/A compliancy
A PDF document must be referenced and passed to the Assembler service to determine whether the PDF document is PDF/A-compliant.
Set run-time options
You can set run-time options that control the behaviour of the Assembler service while it performs a job. For example, you can set an option that instructs the Assembler service to continue processing a job if an error is encountered. For information about the run-time options that you can set, see the AssemblerOptionSpec class reference in AEM Forms API Reference.
Retrieve information about the PDF document
After you create the Assembler service client, reference the DDX document, reference an interactive PDF document, and set run-time options, you can invoke the invokeDDX operation. Because the DDX document contains the DocumentInformation element, the Assembler service returns XML data instead of a PDF document.
Save the returned XML document
The XML document that the Assembler service returns specifies whether the input PDF document is PDF/A-compliant. For example, if the input PDF document is not PDF/A-compliant, the Assembler service returns an XML document that contains the following element:
<PDFAConformance isCompliant="false" compliance="PDF/A-1b" resultLevel="Detailed" ignoreUnusedResources="true" allowCertificationSignatures="true">
Save the XML document as an XML file so that you can open the file and view the results.
See also
Determine whether a document is PDF/A compliant using the Java API
Determine whether a document is PDF/A compliant using the web service API
Determine whether a document is PDF/A compliant using the Java API determine-whether-a-document-is-pdf-a-compliant-using-the-java-api
Determine whether a PDF document is PDF/A-compliant by using the Assembler Service API (Java):
-
Include project files.
Include client JAR files, such as adobe-assembler-client.jar, in your Java project’s class path.
-
Create a PDF Assembler client.
- Create a
ServiceClientFactoryobject that contains connection properties. - Create an
AssemblerServiceClientobject by using its constructor and passing theServiceClientFactoryobject.
- Create a
-
Reference an existing DDX document.
- Create a
java.io.FileInputStreamobject that represents the DDX document by using its constructor and passing a string value that specifies the location of the DDX file. To determine whether the PDF document is PDF/A-compliant, ensure that the DDX document contains thePDFAValidationelement that is contained within aDocumentInformationelement. - Create a
com.adobe.idp.Documentobject by using its constructor and passing thejava.io.FileInputStreamobject.
- Create a
-
Reference a PDF document used to determine PDF/A compliancy.
-
Create a
java.io.FileInputStreamobject by using its constructor and passing the location of a PDF document that is used to determine PDF/A compliancy. -
Create a
com.adobe.idp.Documentobject by using its constructor and passing thejava.io.FileInputStreamobject that contains the PDF document. -
Create a
java.util.Mapobject that is used to store the input PDF document by using aHashMapconstructor. -
Add an entry to the
java.util.Mapobject by invoking itsputmethod and passing the following arguments:- A string value that represents the key name. This value must match the value of the source element specified in the DDX document. For example, the value of the source element located in the DDX document that is introduced in this section is Loan.pdf.
- A
com.adobe.idp.Documentobject that contains the input PDF document.
-
-
Set run-time options.
- Create an
AssemblerOptionSpecobject that stores run-time options by using its constructor. - Set run-time options to meet your business requirements by invoking a method that belongs to the
AssemblerOptionSpecobject. For example, to instruct the Assembler service to continue processing a job when an error occurs, invoke theAssemblerOptionSpecobject’ssetFailOnErrormethod and passfalse.
- Create an
-
Retrieve information about the PDF document.
Invoke the
AssemblerServiceClientobject’sinvokeDDXmethod and pass the following required values:- A
com.adobe.idp.Documentobject that represents the DDX document to use - A
java.util.Mapobject that contains the input PDF file that is used to determine PDF/A compliancy - A
com.adobe.livecycle.assembler.client.AssemblerOptionSpecobject that specifies the run-time options
The
invokeDDXmethod returns acom.adobe.livecycle.assembler.client.AssemblerResultobject that contains XML data that specifies whether the input PDF document is PDF/A-compliant. - A
-
Save the returned XML document.
To obtain XML data that specifies whether the input PDF document is a PDF/A document, perform the following actions:
- Invoke the
AssemblerResultobject’sgetDocumentsmethod. This returns ajava.util.Mapobject. - Iterate through the
java.util.Mapobject until you find the resultantcom.adobe.idp.Documentobject. - Invoke the
com.adobe.idp.Documentobject’scopyToFilemethod to extract the XML document. Ensure that you save the XML data as an XML file.
- Invoke the
See also
Quick Start (SOAP mode): Determining whether a document is PDF/A compliant using the Java API (SOAP mode)
Determine whether a document is PDF/A compliant using the web service API determine-whether-a-document-is-pdf-a-compliant-using-the-web-service-api
Determine whether a PDF document is PDF/A-compliant by using the Assembler Service API (web service):
-
Include project files.
Create a Microsoft .NET project that uses MTOM. Ensure that you use the following WSDL definition:
http://localhost:8080/soap/services/AssemblerService?WSDL&lc_version=9.0.1.note note NOTE Replace localhostwith the IP address of the server hosting AEM Forms. -
Create a PDF Assembler client.
-
Create an
AssemblerServiceClientobject by using its default constructor. -
Create an
AssemblerServiceClient.Endpoint.Addressobject by using theSystem.ServiceModel.EndpointAddressconstructor. Pass a string value that specifies the WSDL to the AEM Forms service (for example,http://localhost:8080/soap/services/AssemblerService?blob=mtom). You do not need to use thelc_versionattribute. This attribute is used when you create a service reference.) -
Create a
System.ServiceModel.BasicHttpBindingobject by getting the value of theAssemblerServiceClient.Endpoint.Bindingfield. Cast the return value toBasicHttpBinding. -
Set the
System.ServiceModel.BasicHttpBindingobject’sMessageEncodingfield toWSMessageEncoding.Mtom. This value ensures that MTOM is used. -
Enable basic HTTP authentication by performing the following tasks:
- Assign the AEM forms user name to the field
AssemblerServiceClient.ClientCredentials.UserName.UserName. - Assign the corresponding password value to the field
AssemblerServiceClient.ClientCredentials.UserName.Password. - Assign the constant value
HttpClientCredentialType.Basicto the fieldBasicHttpBindingSecurity.Transport.ClientCredentialType. - Assign the constant value
BasicHttpSecurityMode.TransportCredentialOnlyto the fieldBasicHttpBindingSecurity.Security.Mode.
- Assign the AEM forms user name to the field
-
-
Reference an existing DDX document.
- Create a
BLOBobject by using its constructor. TheBLOBobject is used to store the DDX document. - Create a
System.IO.FileStreamobject by invoking its constructor and passing a string value that represents the file location of the DDX document and the mode to open the file in. - Create a byte array that stores the content of the
System.IO.FileStreamobject. You can determine the size of the byte array by getting theSystem.IO.FileStreamobject’sLengthproperty. - Populate the byte array with stream data by invoking the
System.IO.FileStreamobject’sReadmethod and passing the byte array, the starting position, and the stream length to read. - Populate the
BLOBobject by assigning itsMTOMfield with the contents of the byte array.
- Create a
-
Reference a PDF document used to determine PDF/A compliancy.
- Create a
BLOBobject by using its constructor. TheBLOBobject is used to store the input PDF document. - Create a
System.IO.FileStreamobject by invoking its constructor and passing a string value that represents the file location of the input PDF document and the mode in which to open the file. - Create a byte array that stores the content of the
System.IO.FileStreamobject. You can determine the size of the byte array by getting theSystem.IO.FileStreamobject’sLengthproperty. - Populate the byte array with stream data by invoking the
System.IO.FileStreamobject’sReadmethod and passing the byte array, the starting position, and the stream length to read. - Populate the
BLOBobject by assigning itsMTOMproperty with the contents of the byte array. - Create a
MyMapOf_xsd_string_To_xsd_anyTypeobject. This collection object is used to store the PDF document. - Create a
MyMapOf_xsd_string_To_xsd_anyType_Itemobject. - Assign a string value that represents the key name to the
MyMapOf_xsd_string_To_xsd_anyType_Itemobject’skeyfield. This value must match the value of the PDF source element specified in the DDX document. - Assign the
BLOBobject that stores the PDF document to theMyMapOf_xsd_string_To_xsd_anyType_Itemobject’svaluefield. - Add the
MyMapOf_xsd_string_To_xsd_anyType_Itemobject to theMyMapOf_xsd_string_To_xsd_anyTypeobject. Invoke theMyMapOf_xsd_string_To_xsd_anyTypeobject’Addmethod and pass theMyMapOf_xsd_string_To_xsd_anyTypeobject.
- Create a
-
Set run-time options.
- Create an
AssemblerOptionSpecobject that stores run-time options by using its constructor. - Set run-time options to meet your business requirements by assigning a value to a data member that belongs to the
AssemblerOptionSpecobject. For example, to instruct the Assembler service to continue processing a job when an error occurs, assignfalseto theAssemblerOptionSpecobject’sfailOnErrordata member.
- Create an
-
Retrieve information about the PDF document.
Invoke the
AssemblerServiceServiceobject’sinvokemethod and pass the following values:- A
BLOBobject that represents the DDX document. - The
MyMapOf_xsd_string_To_xsd_anyTypeobject that contains the input PDF document. Its keys must match the names of the PDF source files, and its values must be theBLOBobject that corresponds to the input PDF file. - An
AssemblerOptionSpecobject that specifies run-time options.
The
invokemethod returns anAssemblerResultobject that contains XML data that specifies whether the input PDF document is a PDF/A document. - A
-
Save the returned XML document.
To obtain XML data that specifies whether the input PDF document is a PDF/A document, perform the following actions:
- Access the
AssemblerResultobject’sdocumentsfield, which is aMapobject that contains the XML data that specifies whether the input PDF document is a PDF/A document. - Iterate through the
Mapobject to obtain each resultant document. Then, cast that array member’s value to aBLOB. - Extract the binary data that represents the XML data by accessing its
BLOBobject’sMTOMfield. This field stores an array of bytes that you can write out to as a XML file.
- Access the
See also