在多个连接上删除性能与 select?
Delete performance vs select on multiple joins?
我正在尝试执行大量删除。
我认为使用连接而不是子查询可以提高性能。
我想到了这个查询:
delete t1
from table1 t1
join table2 t2 on t1.a = t2.a
join table3 t3 on t2.b = t3.b;
它需要很长时间,即使没有行被删除,尽管 select 等价物是即时的:
select *
from table1 t1
join table2 t2 on t1.a = t2.a
join table3 t3 on t2.b = t3.b;
这是为什么?我怎样才能使我的第一个查询更快?
编辑:执行计划
mysql> explain delete t1 from table1 t1 join table2 t2 on t1.a = t2.a join table3 t3 on t2.b = t3.b;
+----+-------------+-------+------------+-------+--------------------------+----------+---------+----------+------+----------+-------------+
| id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra |
+----+-------------+-------+------------+-------+--------------------------+----------+---------+----------+------+----------+-------------+
| 1 | SIMPLE | t2 | NULL | index | PRIMARY | b | 257 | NULL | 1 | 100.00 | Using index |
| 1 | DELETE | t1 | NULL | ref | a,FK2354764DB4B32 | a | 8 | db.t2.a | 1 | 100.00 | NULL |
| 1 | SIMPLE | t3 | NULL | ALL | NULL | NULL | NULL | NULL | 5000 | 10.00 | Using where |
+----+-------------+-------+------------+-------+--------------------------+----------+---------+----------+------+----------+-------------+
edit2:存在 select 的另一个尝试
mysql> explain delete from table1 t1 where exists (select 1 from table2 t2 where t2.a = t1.a and exists (select 1 from table3 t3 where t3.b = t2.b));
+----+--------------------+-------+------------+--------+---------------+---------+---------+---------------+------+----------+-------------------------------------------------------------------+
| id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra |
+----+--------------------+-------+------------+--------+---------------+---------+---------+---------------+------+----------+-------------------------------------------------------------------+
| 1 | DELETE | t1 | NULL | ALL | NULL | NULL | NULL | NULL | 10000| 100.00 | Using where |
| 2 | DEPENDENT SUBQUERY | t2 | NULL | eq_ref | PRIMARY | PRIMARY | 8 | db.t1.a | 1 | 100.00 | NULL |
| 2 | DEPENDENT SUBQUERY | t3 | NULL | ALL | NULL | NULL | NULL | NULL | 5000 | 10.00 | Using where; FirstMatch(t2); Using join buffer (Block Nested Loop)|
+----+--------------------+-------+------------+--------+---------------+---------+---------+---------------+------+----------+-------------------------------------------------------------------+
3 rows in set, 3 warnings (0.00 sec)
谢谢
delete from table1 t1
WHERE exists (
select 1 from table2 t2
where t1.a = t2.a
and exists ( select 1 from table3 t3 where t2.b = t3.b)
;
并确保 t1.a, t2.a, t2.b, t3.b
已编入索引。
此外,如果您要删除 table 上的大部分数据,我建议您导出那些不会被删除的记录并截断源 table。之后就可以导入导出的记录了。
delete from t1
where t1.a in (select distinct t2.a from t2 inner join t3 on t2.b = t3.b)
如果您要删除 table 中的大量行,将要保留的行移动到另一个 table 中通常会更快,然后截断并重新加载原始行table:
-- select the rows we want to keep into a new table
create table tmptable as
select *
from table1 t1
where not exists (
select 1
from table2 t2
inner join table3 t3 on t3.b = t2.b
where t2.a = t1.a
);
-- empty the original table
truncate table table1; -- !! back it up first !!
-- reload it
insert into table1 select * from tmptable;
-- done
drop table tmptable;
我正在尝试执行大量删除。
我认为使用连接而不是子查询可以提高性能。
我想到了这个查询:
delete t1
from table1 t1
join table2 t2 on t1.a = t2.a
join table3 t3 on t2.b = t3.b;
它需要很长时间,即使没有行被删除,尽管 select 等价物是即时的:
select *
from table1 t1
join table2 t2 on t1.a = t2.a
join table3 t3 on t2.b = t3.b;
这是为什么?我怎样才能使我的第一个查询更快?
编辑:执行计划
mysql> explain delete t1 from table1 t1 join table2 t2 on t1.a = t2.a join table3 t3 on t2.b = t3.b;
+----+-------------+-------+------------+-------+--------------------------+----------+---------+----------+------+----------+-------------+
| id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra |
+----+-------------+-------+------------+-------+--------------------------+----------+---------+----------+------+----------+-------------+
| 1 | SIMPLE | t2 | NULL | index | PRIMARY | b | 257 | NULL | 1 | 100.00 | Using index |
| 1 | DELETE | t1 | NULL | ref | a,FK2354764DB4B32 | a | 8 | db.t2.a | 1 | 100.00 | NULL |
| 1 | SIMPLE | t3 | NULL | ALL | NULL | NULL | NULL | NULL | 5000 | 10.00 | Using where |
+----+-------------+-------+------------+-------+--------------------------+----------+---------+----------+------+----------+-------------+
edit2:存在 select 的另一个尝试
mysql> explain delete from table1 t1 where exists (select 1 from table2 t2 where t2.a = t1.a and exists (select 1 from table3 t3 where t3.b = t2.b));
+----+--------------------+-------+------------+--------+---------------+---------+---------+---------------+------+----------+-------------------------------------------------------------------+
| id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra |
+----+--------------------+-------+------------+--------+---------------+---------+---------+---------------+------+----------+-------------------------------------------------------------------+
| 1 | DELETE | t1 | NULL | ALL | NULL | NULL | NULL | NULL | 10000| 100.00 | Using where |
| 2 | DEPENDENT SUBQUERY | t2 | NULL | eq_ref | PRIMARY | PRIMARY | 8 | db.t1.a | 1 | 100.00 | NULL |
| 2 | DEPENDENT SUBQUERY | t3 | NULL | ALL | NULL | NULL | NULL | NULL | 5000 | 10.00 | Using where; FirstMatch(t2); Using join buffer (Block Nested Loop)|
+----+--------------------+-------+------------+--------+---------------+---------+---------+---------------+------+----------+-------------------------------------------------------------------+
3 rows in set, 3 warnings (0.00 sec)
谢谢
delete from table1 t1
WHERE exists (
select 1 from table2 t2
where t1.a = t2.a
and exists ( select 1 from table3 t3 where t2.b = t3.b)
;
并确保 t1.a, t2.a, t2.b, t3.b
已编入索引。
此外,如果您要删除 table 上的大部分数据,我建议您导出那些不会被删除的记录并截断源 table。之后就可以导入导出的记录了。
delete from t1
where t1.a in (select distinct t2.a from t2 inner join t3 on t2.b = t3.b)
如果您要删除 table 中的大量行,将要保留的行移动到另一个 table 中通常会更快,然后截断并重新加载原始行table:
-- select the rows we want to keep into a new table
create table tmptable as
select *
from table1 t1
where not exists (
select 1
from table2 t2
inner join table3 t3 on t3.b = t2.b
where t2.a = t1.a
);
-- empty the original table
truncate table table1; -- !! back it up first !!
-- reload it
insert into table1 select * from tmptable;
-- done
drop table tmptable;