mirror of
https://git.postgresql.org/git/postgresql.git
synced 2024-12-21 08:29:39 +08:00
220 lines
7.2 KiB
Plaintext
220 lines
7.2 KiB
Plaintext
.\" This is -*-nroff-*-
|
|
.\" XXX standard disclaimer belongs here....
|
|
.\" $Header: /cvsroot/pgsql/doc/man/Attic/create_operator.l,v 1.1.1.1 1996/08/18 22:14:21 scrappy Exp $
|
|
.TH "CREATE OPERATOR" SQL 11/05/95 Postgres95 Postgres95
|
|
.SH NAME
|
|
create operator \(em define a new user operator
|
|
.SH SYNOPSIS
|
|
.nf
|
|
\fBcreate operator\fR operator_name
|
|
\fB(\fR[ \fBleftarg\fR \fB=\fR type-1 ]
|
|
[ \fB,\fR \fBrightarg\fR \fB=\fR type-2 ]
|
|
, \fBprocedure =\fR func_name
|
|
[\fB, commutator =\fR com_op ]
|
|
[\fB, negator =\fR neg_op ]
|
|
[\fB, restrict =\fR res_proc ]
|
|
[\fB, hashes\fR]
|
|
[\fB, join =\fR join_proc ]
|
|
[\fB, sort =\fR sor_op1 {\fB,\fR sor_op2 } ]
|
|
\fB)\fR
|
|
.\" \fB"arg is ("
|
|
.\" type [
|
|
.\" \fB,
|
|
.\" type ]
|
|
.\" \fB)
|
|
.fi
|
|
.SH DESCRIPTION
|
|
This command defines a new user operator,
|
|
.IR "operator_name" .
|
|
The user who defines an operator becomes its owner.
|
|
.PP
|
|
The
|
|
.IR "operator_name"
|
|
is a sequence of up to sixteen punctuation characters. The following
|
|
characters are valid for single-character operator names:
|
|
.nf
|
|
~ ! @ # % ^ & ` ?
|
|
.fi
|
|
If the operator name is more than one character long, it may consist
|
|
of any combination of the above characters or the following additional
|
|
characters:
|
|
.nf
|
|
| $ : + - * / < > =
|
|
.fi
|
|
.PP
|
|
At least one of
|
|
.IR leftarg
|
|
and
|
|
.IR rightarg
|
|
must be defined. For binary operators, both should be defined. For
|
|
right unary operators, only
|
|
.IR arg1
|
|
should be defined, while for left unary operators only
|
|
.IR arg2
|
|
should be defined.
|
|
.PP
|
|
The name of the operator,
|
|
.IR operator_name ,
|
|
can be composed of symbols only. Also, the
|
|
.IR func_name
|
|
procedure must have been previously defined using
|
|
.IR "create function" (l)
|
|
and must have one or two arguments.
|
|
.PP
|
|
.\" that multiple instances of the
|
|
.\" operator must be be evaluated
|
|
.\" For example, consider the area-intersection operator,
|
|
.\" .q A,
|
|
.\" and the following expression:
|
|
.\" .(l
|
|
.\" MYBOXES2.description A \*(lq0,0,1,1\*(rq A MYBOXES.description
|
|
.\" .)l
|
|
.\" .in .5i
|
|
.\" The associativity flag indicates that
|
|
.\" .(l
|
|
.\" (MYBOXES2.description A \*(lq0,0,1,1\*(rq) A MYBOXES.description
|
|
.\" .)l
|
|
.\" .in .5i
|
|
.\" is the same as
|
|
.\" .(l
|
|
.\" MYBOXES2.description A (\*(lq0,0,1,1\*(rq A MYBOXES.description).
|
|
.\" .)l
|
|
The commutator operator is present so that Postgres can reverse the order
|
|
of the operands if it wishes. For example, the operator
|
|
area-less-than, >>>, would have a commutator operator,
|
|
area-greater-than, <<<. Suppose that an operator, area-equal, ===,
|
|
exists, as well as an area not equal, !==. Hence, the query optimizer
|
|
could freely convert:
|
|
.nf
|
|
"0,0,1,1"::box >>> MYBOXES.description
|
|
.fi
|
|
to
|
|
.nf
|
|
MYBOXES.description <<< "0,0,1,1"::box
|
|
.fi
|
|
This allows the execution code to always use the latter representation
|
|
and simplifies the query optimizer somewhat.
|
|
.PP
|
|
The negator operator allows the query optimizer to convert
|
|
.nf
|
|
not MYBOXES.description === "0,0,1,1"::box
|
|
.fi
|
|
to
|
|
.nf
|
|
MYBOXES.description !== "0,0,1,1"::box
|
|
.fi
|
|
If a commutator operator name is supplied, Postgres searches for it in
|
|
the catalog. If it is found and it does not yet have a commutator
|
|
itself, then the commutator's entry is updated to have the current
|
|
(new) operator as its commutator. This applies to the negator, as
|
|
well.
|
|
.PP
|
|
This is to allow the definition of two operators that are the
|
|
commutators or the negators of each other. The first operator should
|
|
be defined without a commutator or negator (as appropriate). When the
|
|
second operator is defined, name the first as the commutator or
|
|
negator. The first will be updated as a side effect.
|
|
.PP
|
|
The next two specifications are present to support the query optimizer
|
|
in performing joins. Postgres can always evaluate a join (i.e.,
|
|
processing a clause with two tuple variables separated by an operator
|
|
that returns a boolean) by iterative substitution [WONG76]. In
|
|
addition, Postgres is planning on implementing a hash-join algorithm
|
|
along the lines of [SHAP86]; however, it must know whether this
|
|
strategy is applicable. For example, a hash-join algorithm is usable
|
|
for a clause of the form:
|
|
.nf
|
|
MYBOXES.description === MYBOXES2.description
|
|
.fi
|
|
but not for a clause of the form:
|
|
.nf
|
|
MYBOXES.description <<< MYBOXES2.description.
|
|
.fi
|
|
The
|
|
.BR hashes
|
|
flag gives the needed information to the query optimizer concerning
|
|
whether a hash join strategy is usable for the operator in question.
|
|
.PP
|
|
Similarly, the two sort operators indicate to the query optimizer
|
|
whether merge-sort is a usable join strategy and what operators should
|
|
be used to sort the two operand classes. For the === clause above,
|
|
the optimizer must sort both relations using the operator, <<<. On
|
|
the other hand, merge-sort is not usable with the clause:
|
|
.nf
|
|
MYBOXES.description <<< MYBOXES2.description
|
|
.fi
|
|
If other join strategies are found to be practical, Postgres will change
|
|
the optimizer and run-time system to use them and will require
|
|
additional specification when an operator is defined. Fortunately,
|
|
the research community invents new join strategies infrequently, and
|
|
the added generality of user-defined join strategies was not felt to
|
|
be worth the complexity involved.
|
|
.PP
|
|
The last two pieces of the specification are present so the query
|
|
optimizer can estimate result sizes. If a clause of the form:
|
|
.nf
|
|
MYBOXES.description <<< "0,0,1,1"::box
|
|
.fi
|
|
is present in the qualification, then Postgres may have to estimate the
|
|
fraction of the instances in MYBOXES that satisfy the clause. The
|
|
function res_proc must be a registered function (meaning it is already
|
|
defined using
|
|
.IR "define function" (l))
|
|
which accepts one argument of the correct data type and returns a
|
|
floating point number. The query optimizer simply calls this
|
|
function, passing the parameter
|
|
.nf
|
|
"0,0,1,1"
|
|
.fi
|
|
and multiplies the result by the relation size to get the desired
|
|
expected number of instances.
|
|
.PP
|
|
Similarly, when the operands of the operator both contain instance
|
|
variables, the query optimizer must estimate the size of the resulting
|
|
join. The function join_proc will return another floating point
|
|
number which will be multiplied by the cardinalities of the two
|
|
classes involved to compute the desired expected result size.
|
|
.PP
|
|
The difference between the function
|
|
.nf
|
|
my_procedure_1 (MYBOXES.description, "0,0,1,1"::box)
|
|
.fi
|
|
and the operator
|
|
.nf
|
|
MYBOXES.description === "0,0,1,1"::box
|
|
.fi
|
|
is that Postgres attempts to optimize operators and can decide to use an
|
|
index to restrict the search space when operators are involved.
|
|
However, there is no attempt to optimize functions, and they are
|
|
performed by brute force. Moreover, functions can have any number of
|
|
arguments while operators are restricted to one or two.
|
|
.SH EXAMPLE
|
|
.nf
|
|
--
|
|
--The following command defines a new operator,
|
|
--area-equality, for the BOX data type.
|
|
--
|
|
create operator === (
|
|
leftarg = box,
|
|
rightarg = box,
|
|
procedure = area_equal_procedure,
|
|
commutator = ===,
|
|
negator = !==,
|
|
restrict = area_restriction_procedure,
|
|
hashes,
|
|
join = area-join-procedure,
|
|
sort = <<<, <<<)
|
|
.\" arg is (box, box)
|
|
.fi
|
|
.SH "SEE ALSO"
|
|
create function(l),
|
|
drop operator(l).
|
|
.SH BUGS
|
|
Operator names cannot be composed of alphabetic characters in
|
|
Postgres.
|
|
.PP
|
|
If an operator is defined before its commuting operator has been defined
|
|
(a case specifically warned against above), a dummy operator with invalid
|
|
fields will be placed in the system catalogs. This may interfere with
|
|
the definition of later operators.
|