将选定行从不同目录中的文件复制到另一个文件

Question

我有一个包含许多子目录的目录，其中包含文件。我想在 "App_integrations" 文件夹中打开以 "root.vrpj" 或 "root.vprj" 结尾的文件，并将包含单词 "table" 的行复制到另一个文件。

到目前为止，我已经成功地使用以下代码访问了每个文件：

for root, dirs, files in os.walk(movedir):
for filename in files:
    if filename.endswith(("root.vrpj", "root.vprj")):

问题是我现在只有我想访问的文件的名称，我被困在这里了。

Answer 1

你可以试试这个：

f = open('final_file.txt', 'w')
for root, dirs, files in os.walk(movedir):
   for filename in files:
      if filename.endswith("root.vrpj") or  filename.endswith("root.vprj"):
         with open(filename) as data:
            for line in data:
               if "table" in data:
                   f.write('{}\n'.format(data))
f.close()

Answer 2

这是 Ajax 代码的一个版本，可关闭您在循环中打开的文件（并修复了其他几个小问题）：

with open('final_file.txt', 'w') as f:
    for root, dirs, files in os.walk(movedir):
        for filename in files:
            if filename.endswith(("root.vrpj"), ("root.vprj")):
                with open(os.path.join(root, filename)) as finput:
                     for line in finput:
                         if 'table' in line:
                             f.write(line)

然而，当您看到 8 级缩进时，您需要重构，例如：

def find_files(startdir, *extensions):
    for root, dirs, files in os.walk(movedir):
        for filename in files:
            if filename.endswith(extensions):
                yield os.path.join(root, filename)

def find_lines(fname, text):
    with open(fname) as fp:
         return [line for line in fp if text in line]

with open('final_file.txt', 'w') as f:
    for fname in find_files(movedir, 'root.vrpj', 'root.vprj'):
        f.writelines(find_lines(fname, 'table'))

Answer 3

我终于解决了

    import os

rootdir = my root folder

# creates a file f that contains all the lines of the files 
# with "root.vrpj" or "root.vprj" in their name
# and who are inside "App_integrations" folders
# without duplicates

#creating the big file with all the file containing the lines I need
f = open('final_file.txt', 'a')
for root, dirs, files in os.walk(rootdir):  
    for filename in files:
        if (filename.endswith(("root.vrpj", "root.vprj")) and ("App_Integration" in os.path.join(root, filename))):
            full_name = os.path.join(root, filename) 
            data = open(full_name).read()
            f.write(data + "\n")                 
f.close()

#copying the lines I need to f1 without duplicates
lines_seen = set()
f = open('final_file.txt')
f1 = open('testread1.txt', 'a')
doIHaveToCopyTheLine=False
for line in f.readlines():
    if (("Table" in line) and (line not in lines_seen)):
        doIHaveToCopyTheLine=True
        if doIHaveToCopyTheLine:
            f1.write(line)
            lines_seen.add(line)
f1.close()
f.close()

Answer 4

查找文件

from pathlib import Path
import itertools

source_dir = Path(<source_dir>)

patterns = ['**/*root.vrpj', '**/*root.vprj']

files = itertools.chain.from_iterables(source_dir.glob(pat) for pat in patterns))

过滤文件：

def filter_lines(files):
    for file in files:
        if not 'App_Integration' in file.parts:
            continue
        with file.open('r') as file_handle:
            for line in file_handle:
                if 'table' in line:
                    yield line

写入输出

def save_lines(lines, output_file=sys.std_out):
    for line in lines:
        output_file.write(line)

with Path(<output_file>).open('w') as output_file:
    save_lines(filter_lines(files), as output_file)

将选定行从不同目录中的文件复制到另一个文件

Copying selected lines from files in different directories to another file

python

copy

file-copying

python-2.7

查找文件

过滤文件：

写入输出