如何在 java 中签署包含特殊字符的 XML 文档?

How to sign a XML document that contains special characters, in java?

我正在签署一份包含特殊字符(非 ASCII 字符,如 ã、á、à、ç 等)的 XML 文档,在 java class 中由 PeopleSoft 系统调用。为了避免任何问题,我删除了这些字符,但实际上我需要用它们打印这份签名文档。有什么办法吗?这是待签名的XML文档样本(已准备好签名):

<PedidoEnvioLoteRPS xmlns="http://www.prefeitura.sp.gov.br/nfe" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <Cabecalho Versao="1" xmlns="">
    <CPFCNPJRemetente>
      <CNPJ>99999999999999</CNPJ>
    </CPFCNPJRemetente>
    <transacao>false</transacao>
    <dtInicio>2018-10-02</dtInicio>
    <dtFim>2018-10-02</dtFim>
    <QtdRPS>1</QtdRPS>
    <ValorTotalServicos>40</ValorTotalServicos>
    <ValorTotalDeducoes>0</ValorTotalDeducoes>
  </Cabecalho>
  <RPS xmlns="">
    <Assinatura>FavH23VVIbPWzlvJ28OZZ26Lv2aEgWfmsdhPN1qQN19UCxv6xzu8fHC50wnji3i3G49DuYoXy354U2IxzooPtZYWv7KFUwWLWC4xJYpNKNLOg3txx4znxDNbdC9l/ot9liIMKHf/8rJdciGMpwUOMxt3z95sFVJDcvx/3si1yQG0TaQsWLLLKHH4rUwfE+OWYBIwp/CWBf1/IRzYsFb/q2UgpvfvU1RaXIgI+aNqwYyKulhfUZItI4nYJTsGcXG0y+iXxW3oRWiCGJ5leOysHyJ4VLJcg/vehwT8f8ZQLhvClKeDQUQpL9ts+9oX4PHdc8WXDgN5ekUmvCHS/GW0ew==</Assinatura>
    <ChaveRPS>
      <InscricaoPrestador>99999999</InscricaoPrestador>
      <SerieRPS>1</SerieRPS>
      <NumeroRPS>180</NumeroRPS>
    </ChaveRPS>
    <TipoRPS>RPS</TipoRPS>
    <DataEmissao>2018-10-02</DataEmissao>
    <StatusRPS>N</StatusRPS>
    <TributacaoRPS>T</TributacaoRPS>
    <ValorServicos>40</ValorServicos>
    <ValorDeducoes>0</ValorDeducoes>
    <CodigoServico>3205</CodigoServico>
    <AliquotaServicos>2</AliquotaServicos>
    <ISSRetido>false</ISSRetido>
    <CPFCNPJTomador>
      <CNPJ>88888888888888</CNPJ>
    </CPFCNPJTomador>
    <RazaoSocialTomador>XPTO S.A.</RazaoSocialTomador>
    <EnderecoTomador>
      <Logradouro>Av do Lago</Logradouro>
      <NumeroEndereco>999</NumeroEndereco>
      <ComplementoEndereco>9 andar - cj. 99</ComplementoEndereco>
      <Bairro>Vila Guilherme</Bairro>
      <Cidade>3505708</Cidade>
      <UF>SP</UF>
      <CEP>99999999</CEP>
    </EnderecoTomador>
    <EmailTomador>teste@teste.com.br</EmailTomador>
    <Discriminacao>Tarifa de antecipação de entrega VR Saúde Familiar: R$ 40,00||||||||||||||||IRRF 1,5% Sob Responsabilidade de VR Benefícios Serv  Proc Ltda conforme I.N. 153/87 e |Lei 7450/85, art. 53 - R$ 0,60|Trib aprox. Lei nº 12.741/12: R,38 Federal, R,68 Municipal e R,94 pelos serviços|Fonte:IBPT/empresometro.com.br  A3S28F 18.2.B|Contrato XPTO|Autorização de Regime especial - SEI 6017.2018/0055420-5 (32600,94)|REALIZE O PAGAMENTO APENAS DE BOLETOS EMITIDOS POR VOCÊ NA ÁREA LOGADA E SEGURA DO SEU|PORTAL RH. PREVINA-SE E EVITE PREJUÍZOS FINANCEIROS.</Discriminacao>
    <ValorCargaTributaria>7.06</ValorCargaTributaria>
    <PercentualCargaTributaria>17.64</PercentualCargaTributaria>
    <FonteCargaTributaria>IBPT</FonteCargaTributaria>
  </RPS>

</PedidoEnvioLoteRPS>

我用来签名的java方法是:

public void AssinaXML(String ArqAssinar) {

    try {
        /* Creates the DOM document DOM from the file in ArqAssinar */
        DocumentBuilderFactory DocBuilderFactory = DocumentBuilderFactory.newInstance();
        DocBuilderFactory.setNamespaceAware(true);
        DocumentBuilder DocBuilder = DocBuilderFactory.newDocumentBuilder();
        FileInputStream Input = new FileInputStream(ArqAssinar);
        Document Doc = DocBuilder.parse(Input);

        /* Gets the position of the Signature tag */
        Node Tag = Doc.getDocumentElement();

        if (Tag != null) {
            /* Signs the document */
            DOMSignContext DocSignCont = new DOMSignContext(PrivPass, Tag);
            XMLSignature Signature = XmlSignFac.newXMLSignature(SignInfo, KeyInf);
            Signature.sign(DocSignCont);

            /* Creates the Signature tag with the results */
            OutputStream Saida = new FileOutputStream(ArqAssinar);
            TransformerFactory TransformFac = TransformerFactory.newInstance();
            Transformer Transf = TransformFac.newTransformer();
            Transf.transform(new DOMSource(Doc), new StreamResult(Saida));  
        }
        else {
            System.out.println("Java Assinatura_Digital, método AssinaXML - A tag especificada para inserir a assinatura não foi encontrada");
        }
    }
    catch (Exception E) {
        E.PrintStackTrace();
    }
}

但是当我尝试签署上面的 XML 文档时出现此错误:

com.sun.org.apache.xerces.internal.impl.io.MalformedByteSequenceException: Invalid byte 2 of 3-byte UTF-8 sequence. at com.sun.org.apache.xerces.internal.impl.io.UTF8Reader.invalidByte(Unknown Source) at com.sun.org.apache.xerces.internal.impl.io.UTF8Reader.read(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLEntityScanner.load(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLEntityScanner.skipChar(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.next(Unknown Source) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(Unknown Source) at com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(Unknown Source) at javax.xml.parsers.DocumentBuilder.parse(Unknown Source) at GVR_Assinatura_Digital.AssinaXML(GVR_Assinatura_Digital.java:551) at GVR_Assinatura_Digital.main(GVR_Assinatura_Digital.java:778)

有人知道为什么会这样吗?

正如评论者所说,您可以签署任何有效的 XML 文档。

所以真正的问题应该是,如何创建包含 ã?

等字符的有效 XML 文档

答案:

  • 默认情况下,XML 文档以 UTF-8 编码
  • 因此,您可以通过 Unicode 字符序列的 UTF-8 编码直接将任何字符添加到 XML 文档中。以下是有效的 XML 文档:

UTF-8 编码

<test>Ã¥</test>
  • 您还可以将字符作为字符实体包含在内。但是,一旦您将字符设置为 UTF-8,就无需将它们转换为字符实体。

如果您在创建包含非 ASCII 字符的有效 XML 文档时遇到问题,请创建一个新的 Stack Overflow 问题,其中包含正在(或应该)创建 XML 文件。

另见:

Microsoft article on encoding characters in XML

我得到了解决方案:我刚刚交易了这个:

FileInputStream Input = new FileInputStream(ArqAssinar);
Document Doc = DocBuilder.parse(Input);

为此:

InputStream Input = new FileInputStream(ArqAssinar);
Reader Leitor = new InputStreamReader(Input, "UTF-8");
InputSource Origem = new InputSource(Leitor);
Document Doc = DocBuilder.parse(Origem);