如何在 file1.txt 和 file2.txt 中搜索匹配字符并将输出打印到新文件

How can I search file1.txt and file2.txt for matching characters and print output to a new file

我 运行 遇到了问题!我只是没有足够的知识来独自解决这个问题,所以如果有人能够帮助我,我将不胜感激。

我有两个文本文件:file1.txtfile2.txt;它们具有相似的格式但不完全相同。名称位于不同的行号上,文件具有不同的行数。由于文件中的数据量,手动执行此操作不可行。

文件格式示例:

file1.txt

NAME:FLAT
Jerome:Flat 6
Jimmy:Flat 4

file2.txt

0:NAME:JOB:MONEY:FLAT
1:Bob:Developer:0:Flat 7
2:Jerome:Gardener::Flat 6
3:Cindy:Graphics:0:Flat 5
4:Jimmy:Mod:0:Flat 4

我正在尝试搜索 file1.txt 以查看哪个名称与文件 2 的名称匹配,并将 file2.txt 的整行打印到新的文本文档中。

这是我想做的一个例子:

Checks matching NAME in file1.txt and file2.txt
Ignores "1:Bob:Developer:0:Flat 7" because Bob only exists in file2.txt
Pastes "2:Jerome:Gardener::Flat 6" into file3.txt because Jerome exists in file1.txt and file2.txt
Ignores "3:Cindy:Graphics:0:Flat 5" because Cindy only exists in file2.txt
Pastes "4:Jimmy:Mod:0:Flat 4" into file3.txt because Jimmy exists in file1.txt and file2.txt

file3 的外观 File3.txt

2:Jerome:Gardener::Flat 6
4:Jimmy:Mod:0:Flat 4

感谢阅读!如果有人能让我知道这是否可行,那就太好了。

编辑: 我目前拥有的

awk -F ":" 'FNR==NR{a[];next}( in a){print}' file2.txt file1.txt > file3.txt

使用一些 GNU 工具:

join -t ":" -1 1 -2 2 <(sed 1d File1.txt | sort) <(sort -t ":" -k 2,2 File2.txt) -o 2.1,2.2,2.3,2.4,2.5

输出:

2:Jerome:Gardener::Flat 6
4:Jimmy:Mod:0:Flat 4

参见:info joinman sort

根据您展示的示例,请您尝试以下操作。使用 GNU awk.

编写和测试
awk '
BEGIN  { FS=":" }
FNR==1 { next   }
FNR==NR{
  arr[]
  next
}
( in arr)
' file1.txt file2.txt

说明: 为以上添加详细说明。

awk '                    ##Starting awk program from here.
BEGIN  { FS=":" }        ##Starting BEGIN section from here and setting FS as : here.
FNR==1 { next   }        ##Checking if this is first line in any of Input_file then simply go to next line.
FNR==NR{                 ##This condition will be TRUE when file1.txt is being read.
  arr[]                ##Creating array with  as key here.
  next                   ##next will skip all further statements from here.
}
( in arr)              ##Checking condition if 2nd fueld is in arr then print line from file2.txt
' file1.txt file2.txt    ##Mentioning Input_file names here.