如何在 file1.txt 和 file2.txt 中搜索匹配字符并将输出打印到新文件
How can I search file1.txt and file2.txt for matching characters and print output to a new file
我 运行 遇到了问题!我只是没有足够的知识来独自解决这个问题,所以如果有人能够帮助我,我将不胜感激。
我有两个文本文件:file1.txt
和 file2.txt
;它们具有相似的格式但不完全相同。名称位于不同的行号上,文件具有不同的行数。由于文件中的数据量,手动执行此操作不可行。
文件格式示例:
file1.txt
NAME:FLAT
Jerome:Flat 6
Jimmy:Flat 4
file2.txt
0:NAME:JOB:MONEY:FLAT
1:Bob:Developer:0:Flat 7
2:Jerome:Gardener::Flat 6
3:Cindy:Graphics:0:Flat 5
4:Jimmy:Mod:0:Flat 4
我正在尝试搜索 file1.txt 以查看哪个名称与文件 2 的名称匹配,并将 file2.txt 的整行打印到新的文本文档中。
这是我想做的一个例子:
Checks matching NAME in file1.txt and file2.txt
Ignores "1:Bob:Developer:0:Flat 7" because Bob only exists in file2.txt
Pastes "2:Jerome:Gardener::Flat 6" into file3.txt because Jerome exists in file1.txt and file2.txt
Ignores "3:Cindy:Graphics:0:Flat 5" because Cindy only exists in file2.txt
Pastes "4:Jimmy:Mod:0:Flat 4" into file3.txt because Jimmy exists in file1.txt and file2.txt
file3 的外观
File3.txt
2:Jerome:Gardener::Flat 6
4:Jimmy:Mod:0:Flat 4
感谢阅读!如果有人能让我知道这是否可行,那就太好了。
编辑:
我目前拥有的
awk -F ":" 'FNR==NR{a[];next}( in a){print}' file2.txt file1.txt > file3.txt
使用一些 GNU 工具:
join -t ":" -1 1 -2 2 <(sed 1d File1.txt | sort) <(sort -t ":" -k 2,2 File2.txt) -o 2.1,2.2,2.3,2.4,2.5
输出:
2:Jerome:Gardener::Flat 6
4:Jimmy:Mod:0:Flat 4
参见:info join
和 man sort
根据您展示的示例,请您尝试以下操作。使用 GNU awk
.
编写和测试
awk '
BEGIN { FS=":" }
FNR==1 { next }
FNR==NR{
arr[]
next
}
( in arr)
' file1.txt file2.txt
说明: 为以上添加详细说明。
awk ' ##Starting awk program from here.
BEGIN { FS=":" } ##Starting BEGIN section from here and setting FS as : here.
FNR==1 { next } ##Checking if this is first line in any of Input_file then simply go to next line.
FNR==NR{ ##This condition will be TRUE when file1.txt is being read.
arr[] ##Creating array with as key here.
next ##next will skip all further statements from here.
}
( in arr) ##Checking condition if 2nd fueld is in arr then print line from file2.txt
' file1.txt file2.txt ##Mentioning Input_file names here.
我 运行 遇到了问题!我只是没有足够的知识来独自解决这个问题,所以如果有人能够帮助我,我将不胜感激。
我有两个文本文件:file1.txt
和 file2.txt
;它们具有相似的格式但不完全相同。名称位于不同的行号上,文件具有不同的行数。由于文件中的数据量,手动执行此操作不可行。
文件格式示例:
file1.txt
NAME:FLAT
Jerome:Flat 6
Jimmy:Flat 4
file2.txt
0:NAME:JOB:MONEY:FLAT
1:Bob:Developer:0:Flat 7
2:Jerome:Gardener::Flat 6
3:Cindy:Graphics:0:Flat 5
4:Jimmy:Mod:0:Flat 4
我正在尝试搜索 file1.txt 以查看哪个名称与文件 2 的名称匹配,并将 file2.txt 的整行打印到新的文本文档中。
这是我想做的一个例子:
Checks matching NAME in file1.txt and file2.txt
Ignores "1:Bob:Developer:0:Flat 7" because Bob only exists in file2.txt
Pastes "2:Jerome:Gardener::Flat 6" into file3.txt because Jerome exists in file1.txt and file2.txt
Ignores "3:Cindy:Graphics:0:Flat 5" because Cindy only exists in file2.txt
Pastes "4:Jimmy:Mod:0:Flat 4" into file3.txt because Jimmy exists in file1.txt and file2.txt
file3 的外观 File3.txt
2:Jerome:Gardener::Flat 6
4:Jimmy:Mod:0:Flat 4
感谢阅读!如果有人能让我知道这是否可行,那就太好了。
编辑: 我目前拥有的
awk -F ":" 'FNR==NR{a[];next}( in a){print}' file2.txt file1.txt > file3.txt
使用一些 GNU 工具:
join -t ":" -1 1 -2 2 <(sed 1d File1.txt | sort) <(sort -t ":" -k 2,2 File2.txt) -o 2.1,2.2,2.3,2.4,2.5
输出:
2:Jerome:Gardener::Flat 6 4:Jimmy:Mod:0:Flat 4
参见:info join
和 man sort
根据您展示的示例,请您尝试以下操作。使用 GNU awk
.
awk '
BEGIN { FS=":" }
FNR==1 { next }
FNR==NR{
arr[]
next
}
( in arr)
' file1.txt file2.txt
说明: 为以上添加详细说明。
awk ' ##Starting awk program from here.
BEGIN { FS=":" } ##Starting BEGIN section from here and setting FS as : here.
FNR==1 { next } ##Checking if this is first line in any of Input_file then simply go to next line.
FNR==NR{ ##This condition will be TRUE when file1.txt is being read.
arr[] ##Creating array with as key here.
next ##next will skip all further statements from here.
}
( in arr) ##Checking condition if 2nd fueld is in arr then print line from file2.txt
' file1.txt file2.txt ##Mentioning Input_file names here.