nokogiri如何提取部分字段

nokogiri how to extract part of the field

我的文本文件如下:

<first>1</first><Name>wangli</Name><birthday>19860105</birthday><address>Here</address>
<first>2</first><Name>zhangli</Name><birthday>19870105</birthday><address>Sangdu</address>
<first>3</first><Name>lili</Name><birthday>19880105</birthday><address>Hongkong</address>
<first>4</first><Name>liuli</Name><birthday>19860515</birthday><address>London</address>

我想用 ruby gem nokogiri 创建一个新文件,例如:

wangli-Here
zhangli-Sangdu
lili-Hongkong
liuli-London

我用过:

require 'nokogiri'
doc = Nokogiri::XML(File.open("file"),nil,"gbk")
puts doc.xpath("/name") + doc.xpath("/address")

无法工作

看来问题解决了! 我更改文本文件:

<doc>
<line><first>1</first><Name>wangli</Name><birthday>19860105</birthday><address>Here</address></line>
<line><first>2</first><Name>zhangli</Name><birthday>19870105</birthday><address>Sangdu</address></line>
<line><first>3</first><Name>lili</Name><birthday>19880105</birthday><address>Hongkong</address></line>
<line><first>4</first><Name>liuli</Name><birthday>19860515</birthday><address>London</address></line>
</doc>

然后是 ruby 代码

require 'nokogiri'
doc = Nokogiri::XML(File.open("27065"),nil,"gbk")
doc.xpath("//line").each do |line|
    l.xpath("./name").text + "-" + line.xpath("./address").text
end

由于您输入的每一行 XML 都包含一个 XML 片段,您必须逐行处理。此外,您需要使用 Nokogiri::XML.fragment 来解析每一行。这是工作示例:

require "nokogiri"

output = File.open("output.txt", "w")

File.open("input.xml", "r") do |f|
    f.each_line do |line|
        frag = Nokogiri::XML.fragment(line)
        output.puts "#{frag.search('Name').text}=#{frag.search('address').text}"
    end
end

output.close