在 plpgsql 中生成唯一的随机字符串

Generate unique random strings in plpgsql

我正在尝试编写一个函数来创建可变长度的唯一随机标记。但是,我被 plpgsql 语法难住了。我的意图是创建一个函数

我目前的尝试是这样的:

CREATE FUNCTION random_token(_table TEXT, _column TEXT, _length INTEGER) RETURNS text AS $$
DECLARE
  alphanum CONSTANT text := 'abcdefghijkmnopqrstuvwxyz23456789';
  range_head CONSTANT integer := 25;
  range_tail CONSTANT integer := 33;
  random_string text;
BEGIN
  REPEAT
    SELECT substring(alphanum from trunc(random() * range_head + 1)::integer for 1) ||
      array_to_string(array_agg(substring(alphanum from trunc(random() * range_tail + 1)::integer for 1)), '')
      INTO random_string FROM generate_series(1, _length - 1);
  UNTIL random_string NOT IN FORMAT('SELECT %I FROM %I WHERE %I = random_string;', _column, _table, _column)
  END REPEAT;
  RETURN random_string;
END
$$ LANGUAGE plpgsql;

但是,这不起作用,并给了我一个不是很有帮助的错误:

DatabaseError: error 'ERROR: syntax error at or near "REPEAT"

我尝试了多种变体,但不知道语法中的错误是什么,我感到很困惑。知道如何修复此功能吗?

plpgsql中没有repeat语句。使用简单的 loop.

CREATE OR REPLACE FUNCTION random_token(_table TEXT, _column TEXT, _length INTEGER) RETURNS text AS $$
DECLARE
  alphanum CONSTANT text := 'abcdefghijkmnopqrstuvwxyz23456789';
  range_head CONSTANT integer := 25;
  range_tail CONSTANT integer := 33;
  random_string text;
  ct int;
BEGIN
  LOOP
    SELECT substring(alphanum from trunc(random() * range_head + 1)::integer for 1) ||
      array_to_string(array_agg(substring(alphanum from trunc(random() * range_tail + 1)::integer for 1)), '')
      INTO random_string FROM generate_series(1, _length - 1);
    EXECUTE FORMAT('SELECT count(*) FROM %I WHERE %I = %L', _table, _column, random_string) INTO ct;
    EXIT WHEN ct = 0;
  END LOOP;
  RETURN random_string;
END
$$ LANGUAGE plpgsql;

注意,random_string 应该是 format() 的参数。

更新。根据 Abelisto 的准确提示,对于大 table:

这应该更快
DECLARE
  dup boolean;
...
    EXECUTE FORMAT('SELECT EXISTS(SELECT 1 FROM %I WHERE %I = %L)', _table, _column, random_string) INTO dup;
    EXIT WHEN NOT dup;
...

这几乎肯定不是您想要的。当您说 "checks if the string is already in the column" 时,您指的不是看起来很独特的东西,而是实际上是 UNIQUE 的东西。

Instead, I would point you over this answer I gave about UUIDs.