比较图像是否相等但输出很大

Question

我一直在尝试以 ASCII 值读取图像并比较两幅图像以获得精确的 ASCII 值匹配。但是，输出非常大，我的硬件很旧，无法读取输出，我尝试将输出保存到文件中，但文件很大。这是我正在尝试做的事情：

orig = sys.stdout
f = open('output.txt','w')
sys.stdout = f

# Load the two Images 

with open("image1.jpg", "rb") as b:
 with open("image2.jpg", "rb") as a:

  # Convert the two images from binary to ascii

    chunk1 = binascii.b2a_hex(b.read())
    chunk2 = binascii.b2a_hex(a.read())

# split the two chunks of ascii values into a list of 24 bytes 

chunkSize = 24
for i in range (0,len(chunk1),chunkSize):
 for j in range (0,len(chunk2),chunkSize):

 # Print them

  list1 = chunk1[i:i+chunkSize]
  print "List1: "+ list1
  list2 = chunk2[j:j+chunkSize]
  print "List2: " + list2

# Compare the two images for equality 

  list = list1 == list2

 # print whether its a match or false

  print list

sys.stdout = orig
f.close()

# Saved to a file

工作原理：

img1 有以下十六进制：FFD8 FFE0 0010 4A46 4946 0001 0200 0064 0064 0000 FFEC 0011 img2 具有以下十六进制：FFD8 FFE0 0010 4A46 4946 0001 0210 0064 0064 0000 FFEC 0012

它会使用 img1 的前 24 个字符并一次测试 24 个字符中的所有 img2 十六进制，然后使用 img1 的下一个 24 个字符并测试所有 img2 十六进制。示例：

List1: FFD8 FFE0 0010 4A46 4946 0001 
List2: FFD8 FFE0 0010 4A46 4946 0001 
True 

List1: FFD8 FFE0 0010 4A46 4946 0001 
List2: 0210 0064 0064 0000 FFEC 0012 
False

List1: 0200 0064 0064 0000 FFEC 0011 
List2: FFD8 FFE0 0010 4A46 4946 0001 
False 

List1: 0200 0064 0064 0000 FFEC 0011 
List2: 0210 0064 0064 0000 FFEC 0012 
False

但是，考虑到像 40k 十六进制和 20k 这样的巨大图像，我无法从终端读取或者将输出保存到文件中，因此输出很大。

如何只打印匹配的（真）24 个字符 ASCII 十六进制值而不打印真、假和假 ASCII 十六进制值？

FFD8 FFE0 0010 4A46 4946 0001

Answer 1

您可以一次从每个图像中读取 24 个字节，而不是一次读取整个文件。 file.read() accepts a parameter that allows it to just read a couple of bytes at a time. You can run this in a loop until read() returns an empty string which means that the end of file has been reached. See the doc.

编辑:

如果您只想检查两个文件是否相同，为什么不查看校验和呢？相同的文件将始终具有相同的校验和。请参阅此 answer 了解更多详细信息。

Answer 2

如果我理解了问题，怎么样：

orig = sys.stdout
f = open('output.txt','w')
sys.stdout = f

# Load the two Images 

with open("image1.jpg", "rb") as b:
 with open("image2.jpg", "rb") as a:

  # Convert the two images from binary to ascii

    chunk1 = binascii.b2a_hex(b.read())
    chunk2 = binascii.b2a_hex(a.read())

# split the two chunks of ascii values into a list of 24 bytes 

chunkSize = 24
for i in range (0,len(chunk1),chunkSize):
 for j in range (0,len(chunk2),chunkSize):

  list1 = chunk1[i:i+chunkSize]
  list2 = chunk2[j:j+chunkSize]

  # Compare the two images for equality 

  list = list1 == list2

  # print bytes once only if they were the same in both list1 and list2

  if list:
   print list1

sys.stdout = orig
f.close()

这将忽略原始示例中为 False 的任何输出，唯一的输出将是匹配的字节。如果这不是您的意思，您能否明确说明您想要实现的目标？

比较图像是否相等但输出很大

Comparing Images for equality but output is large

python

printing

equality