在多个连接上删除性能与 select?

Delete performance vs select on multiple joins?

我正在尝试执行大量删除。

我认为使用连接而不是子查询可以提高性能。

我想到了这个查询:

delete t1
    from table1 t1
    join table2 t2  on t1.a = t2.a
    join table3 t3  on t2.b = t3.b;

它需要很长时间,即使没有行被删除,尽管 select 等价物是即时的:

select *
    from table1 t1
    join table2 t2 on t1.a = t2.a
    join table3 t3 on t2.b = t3.b;

这是为什么?我怎样才能使我的第一个查询更快?

编辑:执行计划

mysql> explain delete t1 from table1 t1 join table2 t2 on t1.a = t2.a join table3 t3 on t2.b = t3.b;
+----+-------------+-------+------------+-------+--------------------------+----------+---------+----------+------+----------+-------------+
| id | select_type | table | partitions | type  | possible_keys            | key      | key_len | ref      | rows | filtered | Extra       |
+----+-------------+-------+------------+-------+--------------------------+----------+---------+----------+------+----------+-------------+
|  1 | SIMPLE      | t2    | NULL       | index | PRIMARY                  | b        | 257     | NULL     |    1 |   100.00 | Using index |
|  1 | DELETE      | t1    | NULL       | ref   | a,FK2354764DB4B32        | a        | 8       | db.t2.a  |    1 |   100.00 | NULL        |
|  1 | SIMPLE      | t3    | NULL       | ALL   | NULL                     | NULL     | NULL    | NULL     | 5000 |    10.00 | Using where |
+----+-------------+-------+------------+-------+--------------------------+----------+---------+----------+------+----------+-------------+

edit2:存在 select 的另一个尝试

mysql> explain delete from table1 t1 where exists (select 1 from table2 t2 where t2.a = t1.a and exists (select 1 from table3 t3 where t3.b = t2.b));
+----+--------------------+-------+------------+--------+---------------+---------+---------+---------------+------+----------+-------------------------------------------------------------------+
| id | select_type        | table | partitions | type   | possible_keys | key     | key_len | ref           | rows | filtered | Extra                                                             |
+----+--------------------+-------+------------+--------+---------------+---------+---------+---------------+------+----------+-------------------------------------------------------------------+
|  1 | DELETE             | t1    | NULL       | ALL    | NULL          | NULL    | NULL    | NULL          | 10000|   100.00 | Using where                                                       |
|  2 | DEPENDENT SUBQUERY | t2    | NULL       | eq_ref | PRIMARY       | PRIMARY | 8       | db.t1.a       |    1 |   100.00 | NULL                                                              |
|  2 | DEPENDENT SUBQUERY | t3    | NULL       | ALL    | NULL          | NULL    | NULL    | NULL          | 5000 |    10.00 | Using where; FirstMatch(t2); Using join buffer (Block Nested Loop)|
+----+--------------------+-------+------------+--------+---------------+---------+---------+---------------+------+----------+-------------------------------------------------------------------+
3 rows in set, 3 warnings (0.00 sec)

谢谢

delete from table1 t1 
WHERE exists (
  select 1 from table2 t2 
  where t1.a = t2.a 
    and exists ( select 1 from table3 t3 where t2.b = t3.b)
;

并确保 t1.a, t2.a, t2.b, t3.b 已编入索引。

此外,如果您要删除 table 上的大部分数据,我建议您导出那些不会被删除的记录并截断源 table。之后就可以导入导出的记录了。

delete from t1
 where t1.a in (select distinct t2.a from t2 inner join t3 on t2.b = t3.b)

如果您要删除 table 中的大量行,将要保留的行移动到另一个 table 中通常会更快,然后截断并重新加载原始行table:

-- select the rows we want to keep into a new table
create table tmptable as 
select *
from table1 t1
where not exists (
    select 1
    from table2 t2
    inner join table3 t3 on t3.b = t2.b
    where t2.a = t1.a
);

-- empty the original table
truncate table table1;  -- !! back it up first !!

-- reload it
insert into table1 select * from tmptable;

-- done
drop table tmptable;