mirror of
https://git.postgresql.org/git/postgresql.git
synced 2025-01-24 18:55:04 +08:00
5709 lines
228 KiB
Plaintext
5709 lines
228 KiB
Plaintext
From owner-pgsql-hackers@hub.org Tue Oct 13 15:05:53 1998
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id PAA09435
|
|
for <maillist@candle.pha.pa.us>; Tue, 13 Oct 1998 15:05:50 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id OAA11700;
|
|
Tue, 13 Oct 1998 14:43:31 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Tue, 13 Oct 1998 14:41:03 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id OAA11395
|
|
for pgsql-hackers-outgoing; Tue, 13 Oct 1998 14:41:00 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from terry1.acun.com (terry@terry1.acun.com [206.27.86.12])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id OAA11372
|
|
for <hackers@postgreSQL.org>; Tue, 13 Oct 1998 14:40:54 -0400 (EDT)
|
|
(envelope-from terry@terrym.com)
|
|
Received: from localhost (terry@localhost)
|
|
by terry1.acun.com (8.8.5/8.8.5) with SMTP id OAA09491
|
|
for <hackers@postgreSQL.org>; Tue, 13 Oct 1998 14:53:22 -0400
|
|
Date: Tue, 13 Oct 1998 14:53:22 -0400 (EDT)
|
|
From: Terry Mackintosh <terry@terrym.com>
|
|
X-Sender: terry@terry1.acun.com
|
|
Reply-To: Terry Mackintosh <terry@terrym.com>
|
|
To: PostgreSQL-development <hackers@postgreSQL.org>
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
In-Reply-To: <Pine.GSO.3.96.SK.981013211058.17758A-100000@ra>
|
|
Message-ID: <Pine.LNX.3.95.981013141634.9255C-100000@terry1.acun.com>
|
|
MIME-Version: 1.0
|
|
Content-Type: TEXT/PLAIN; charset=US-ASCII
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: ROr
|
|
|
|
Hi, my 2 cents...
|
|
|
|
I agree completely, LIMIT would be VERY usefull in web based apps, which
|
|
is all I run. It does not matter to me if it is not part of a formal
|
|
standard. The idea is so common that it is a defacto standard.
|
|
|
|
I would not expect it for this release, but could it get put on the TODO
|
|
list for next time? I am even willing to work at an apprentise level on
|
|
this with a more expeireanced person that knows this stuff.
|
|
|
|
A note on implimentation:
|
|
I *used to* :) work with VFP on NT's :(
|
|
And the way VFP did LIMIT, it would only return the number of rows asked
|
|
for, BUT it still did the WHOLE search!
|
|
So on a larger table, which we had (property tax database for the county),
|
|
if some one put in too vague a query, it would try to collect ALL of the
|
|
rows as the initial result set, then give you the first x rows of that.
|
|
|
|
This did save on pushing mass amounts of data out to the browser, but it
|
|
would have been even better if it could have simply aborted the select
|
|
after having found x rows.
|
|
|
|
Also, it did not have the concept of an offset, so one could not select
|
|
100 rows, starting 200 rows in, which would be REALLY usefull for "paging"
|
|
through data. I do not know if mySQL or any other has such a concept
|
|
either, but it would be nice.
|
|
|
|
So a properly implemented "LIMIT" could:
|
|
1. Save pushing mass amounts of data across the web, that no one wants
|
|
any way.
|
|
2. Stop vague queries from bogging down the server.
|
|
(On very larg tables this could be critical!)
|
|
3. Enable "Paging" of data. (easyer then now (app. level))
|
|
4. Would be a very nice feather in PostgreSQL's cap that could make it
|
|
even more attractive to those looking at all sorts of databases out there.
|
|
|
|
Have a great day.
|
|
|
|
On Tue, 13 Oct 1998, Oleg Bartunov wrote:
|
|
|
|
> Hi,
|
|
>
|
|
> I took a look at mysql and was very impressed with possibility
|
|
> to limit number of rows returned from select. This is very useful
|
|
> feature for Web applications when user need to browse results of
|
|
> selection page by page. In my application I have to do full
|
|
> select every time user press button [Next] and show requested page
|
|
> using perl. This works more or less ok for several thousands rows but
|
|
> totally unusable for large selections. But now I'm about to work
|
|
> with big database and I don't know how I'll stay with postgres :-)
|
|
> It'll just doesn't work if customer will wait several minutes just browse
|
|
> next page. Mysql lacks some useful features postgres has
|
|
> (subselects, transaction ..) but for most Web applications I need
|
|
> just select :-) I dont' know how LIMIT is implemented in Mysql and
|
|
> I know it's not in SQL92 standart, but this makes Mysql very popular.
|
|
>
|
|
> Is it difficult to implement this feature in postgres ?
|
|
>
|
|
> Regards,
|
|
>
|
|
> Oleg
|
|
>
|
|
>
|
|
> _____________________________________________________________
|
|
> Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
|
|
> Sternberg Astronomical Institute, Moscow University (Russia)
|
|
> Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
|
|
> phone: +007(095)939-16-83, +007(095)939-23-83
|
|
>
|
|
>
|
|
|
|
Terry Mackintosh <terry@terrym.com> http://www.terrym.com
|
|
sysadmin/owner Please! No MIME encoded or HTML mail, unless needed.
|
|
|
|
Proudly powered by R H Linux 4.2, Apache 1.3, PHP 3, PostgreSQL 6.3
|
|
-------------------------------------------------------------------
|
|
Success Is A Choice ... book by Rick Patino, get it, read it!
|
|
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Tue Oct 13 18:12:41 1998
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id SAA12156
|
|
for <maillist@candle.pha.pa.us>; Tue, 13 Oct 1998 18:12:39 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id RAA04181;
|
|
Tue, 13 Oct 1998 17:56:17 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Tue, 13 Oct 1998 17:54:49 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id RAA03869
|
|
for pgsql-hackers-outgoing; Tue, 13 Oct 1998 17:54:47 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from remapcorp.com (root@remapcorp.com [206.196.37.193])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id RAA03838
|
|
for <hackers@postgreSQL.org>; Tue, 13 Oct 1998 17:54:36 -0400 (EDT)
|
|
(envelope-from jeff@remapcorp.com)
|
|
Received: from go-to-jail (gotojail.remapcorp.com [206.196.37.197])
|
|
by remapcorp.com (8.8.7/8.8.7) with SMTP id QAA25337;
|
|
Tue, 13 Oct 1998 16:55:35 -0500 (CDT)
|
|
(envelope-from jeff@remapcorp.com)
|
|
Message-ID: <006701bdf6f4$60ed75f0$c525c4ce@go-to-jail.remapcorp.com>
|
|
From: "Jeff Hoffmann" <jeff@remapcorp.com>
|
|
To: "Marc G. Fournier" <scrappy@hub.org>, "Eric Lee Green" <eric@linux-hw.com>
|
|
Cc: "PostgreSQL-development" <hackers@postgreSQL.org>
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
Date: Tue, 13 Oct 1998 16:56:48 -0500
|
|
MIME-Version: 1.0
|
|
Content-Type: text/plain;
|
|
charset="iso-8859-1"
|
|
Content-Transfer-Encoding: 7bit
|
|
X-Priority: 3
|
|
X-MSMail-Priority: Normal
|
|
X-Mailer: Microsoft Outlook Express 4.72.3115.0
|
|
X-MimeOLE: Produced By Microsoft MimeOLE V4.72.3110.3
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
>On Tue, 13 Oct 1998, Eric Lee Green wrote:
|
|
>
|
|
>> On Tue, 13 Oct 1998, Jeff Hoffmann wrote:
|
|
>> > >I agree completely, LIMIT would be VERY usefull in web based apps,
|
|
which
|
|
>> > >is all I run. It does not matter to me if it is not part of a formal
|
|
>> > >standard. The idea is so common that it is a defacto standard.
|
|
>> >
|
|
>> > i'm not familiar with mysql and using "LIMIT" but wouldn't this same
|
|
effect
|
|
>> > be achieved by declaring a cursor and fetching however many records in
|
|
the
|
|
>> > cursor? it's a very noticeable improvement when you only want the
|
|
first 20
|
|
>> > out of 500 in a 200k record database, at least.
|
|
>>
|
|
>> The problem with declaring a cursor vs. the "LIMIT" clause is that the
|
|
>> "LIMIT" clause, if used properly by the database engine (along with the
|
|
>> database engine using indexes in "ORDER BY" clauses) allows the database
|
|
>> engine to short-circuit the tail end of the query. That is, if you have
|
|
25
|
|
>> names and the last one ends with BEAVIS, the database engine doesn't have
|
|
>> to go through the BUTTHEADS and KENNYs and etc.
|
|
>>
|
|
>> Theoretically a cursor is superior to the "LIMIT" clause because you're
|
|
>> eventually going to want the B's and K's and etc. anyhow -- but only in a
|
|
>> stateful enviornment. In the stateless web environment, a cursor is
|
|
>> useless because the connection can close at any time even when you're
|
|
>> using "persistent" connections (and of course when the connection closes
|
|
>> the cursor closes).
|
|
>
|
|
>Ookay, I'm sorry, butyou lost me here. I haven't gotten into using
|
|
>CURSORs/FETCHs yet, since I haven't need it...but can you give an example
|
|
>of what you would want to do using a LIMIT? I may be missing something,
|
|
>but wha is the different between using LIMIT to get X records, and
|
|
>definiing a cursor to FETCH X records?
|
|
>
|
|
>Practical example of *at least* the LIMIT side would be good, so that we
|
|
>can at least see a physical example of what LIMIT can do that
|
|
>CURSORs/FETCH can't...
|
|
>
|
|
|
|
|
|
fetch with cursors should work properly (i.e., you can short circuit it by
|
|
just ending your transaction) my understanding on how this works is exactly
|
|
how you explained LIMIT to work. here's some empirical proof from one of my
|
|
sample databases:
|
|
|
|
the sample table i'm using has 156k records (names of people)
|
|
i'm using a PP180 with 128MB RAM and some old slow SCSI drives.
|
|
|
|
public_mn=> select count(*) from public_ramsey;
|
|
count
|
|
------
|
|
156566
|
|
(1 row)
|
|
|
|
i did the following query:
|
|
public_mn=> select * from public_ramsey where ownerlname ~ 'SMITH';
|
|
|
|
which returned 711 matches and took about 12 seconds.
|
|
|
|
i did the same thing with a cursor:
|
|
|
|
public_mn=> begin;
|
|
BEGIN
|
|
public_mn=> declare test cursor for select * from public_ramsey where
|
|
ownerlname ~ 'SMITH';
|
|
SELECT
|
|
|
|
the select was instantaneous.
|
|
|
|
public_mn=> fetch 20 in test;
|
|
|
|
returns 20 records almost instantaneously. each additional 20 took less
|
|
than a second, as well.
|
|
|
|
if this isn't what you're talking about, i don't understand what you're
|
|
saying.
|
|
|
|
jeff
|
|
|
|
|
|
|
|
From eric@ireland.linux-hw.com Tue Oct 13 18:52:42 1998
|
|
Received: from ireland.linux-hw.com (IDENT:eric@ireland.linux-hw.com [199.72.95.215])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id SAA12388
|
|
for <maillist@candle.pha.pa.us>; Tue, 13 Oct 1998 18:52:40 -0400 (EDT)
|
|
Received: from localhost (eric@localhost)
|
|
by ireland.linux-hw.com (8.8.7/8.8.7) with SMTP id SAA31316;
|
|
Tue, 13 Oct 1998 18:55:22 -0400
|
|
Date: Tue, 13 Oct 1998 18:55:22 -0400 (EDT)
|
|
From: Eric Lee Green <eric@linux-hw.com>
|
|
To: Bruce Momjian <maillist@candle.pha.pa.us>
|
|
cc: jeff@remapcorp.com, hackers@postgreSQL.org
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
In-Reply-To: <199810132116.RAA11249@candle.pha.pa.us>
|
|
Message-ID: <Pine.LNX.3.96.981013184022.31202B-100000@ireland.linux-hw.com>
|
|
MIME-Version: 1.0
|
|
Content-Type: TEXT/PLAIN; charset=US-ASCII
|
|
Status: RO
|
|
|
|
On Tue, 13 Oct 1998, Bruce Momjian wrote:
|
|
> > Theoretically a cursor is superior to the "LIMIT" clause because you're
|
|
> > eventually going to want the B's and K's and etc. anyhow -- but only in a
|
|
> > stateful enviornment. In the stateless web environment, a cursor is
|
|
> > useless because the connection can close at any time even when you're
|
|
> > using "persistent" connections (and of course when the connection closes
|
|
> What we could do is _if_ there is only one table(no joins), and an index
|
|
> exists that matches the ORDER BY, we could use the index to
|
|
> short-circuit the query.
|
|
|
|
This is exactly what MySQL does in this situation, except that it can use
|
|
the ORDER BY to do the short circuiting even if there is a join involved
|
|
if all of the elements of the ORDER BY belong to one table. Obviously if
|
|
I'm doing an "ORDER BY table1.foo table2.bar" that isn't going to work!
|
|
But "select table1.fsname,table1.lname,table2.receivables where
|
|
table2.receivables > 0 and table1.custnum=table2.custnum order by
|
|
(table1.lname,table1.fsname) limit 50" can be short-circuited by fiddling
|
|
with the join order -- table1.fsname table1.lname have to be the first two
|
|
things in the join order.
|
|
|
|
Whether this is feasible in PostgreSQL I have no earthly idea. This would
|
|
seem to conflict with the join optimizer.
|
|
|
|
> happier? If there is an ORDER BY and no index, or a join, I can't
|
|
> figure out how we would short-circuit the query.
|
|
|
|
If there is an ORDER BY and no index you can't short-circuit the query.
|
|
MySQL doesn't either. Under certain circumstances (such as above) you can
|
|
short-circuit a join, but it's unclear whether it'd be easy to add such
|
|
a capability to PostgreSQL given the current structure of the query
|
|
optimizer. (And I certainly am not in a position to tackle it, at the
|
|
moment MySQL is sufficing for my project despite the fact that it is
|
|
quite limited compared to PostgreSQL, I need to get my project finished
|
|
first).
|
|
|
|
--
|
|
Eric Lee Green eric@linux-hw.com http://www.linux-hw.com/~eric
|
|
"To call Microsoft an innovator is like calling the Pope Jewish ..."
|
|
-- James Love (Consumer Project on Technology)
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 14 09:01:01 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id JAA24574
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 09:01:00 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id HAA17762 for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 07:47:57 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id HAA09214;
|
|
Wed, 14 Oct 1998 07:04:59 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 07:00:44 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id HAA09116
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 07:00:40 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id HAA09102
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 07:00:27 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id NAA05037; Wed, 14 Oct 1998 13:02:40 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma004737; Wed, 14 Oct 98 13:02:09 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id MAA20155;
|
|
Wed, 14 Oct 1998 12:59:23 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id NAA20772;
|
|
Wed, 14 Oct 1998 13:01:35 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<eric@linux-hw.com>>
|
|
id m0zTMGL-000B5AC; Wed, 14 Oct 98 10:26 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for eric@linux-hw.com
|
|
id m0zTOnx-000EBRC; Wed, 14 Oct 98 13:09 MET DST
|
|
Message-Id: <m0zTOnx-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: eric@linux-hw.com (Eric Lee Green)
|
|
Date: Wed, 14 Oct 1998 13:09:21 +0200 (MET DST)
|
|
Cc: jeff@remapcorp.com, hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <Pine.LNX.3.96.981013161955.30555A-100000@ireland.linux-hw.com> from "Eric Lee Green" at Oct 13, 98 04:24:20 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: ROr
|
|
|
|
Eric Lee Green wrote:
|
|
>
|
|
> On Tue, 13 Oct 1998, Jeff Hoffmann wrote:
|
|
> > >I agree completely, LIMIT would be VERY usefull in web based apps, which
|
|
> > >is all I run. It does not matter to me if it is not part of a formal
|
|
> > >standard. The idea is so common that it is a defacto standard.
|
|
> >
|
|
> > i'm not familiar with mysql and using "LIMIT" but wouldn't this same effect
|
|
> > be achieved by declaring a cursor and fetching however many records in the
|
|
> > cursor? it's a very noticeable improvement when you only want the first 20
|
|
> > out of 500 in a 200k record database, at least.
|
|
>
|
|
> The problem with declaring a cursor vs. the "LIMIT" clause is that the
|
|
> "LIMIT" clause, if used properly by the database engine (along with the
|
|
> database engine using indexes in "ORDER BY" clauses) allows the database
|
|
> engine to short-circuit the tail end of the query. That is, if you have 25
|
|
> names and the last one ends with BEAVIS, the database engine doesn't have
|
|
> to go through the BUTTHEADS and KENNYs and etc.
|
|
>
|
|
> Theoretically a cursor is superior to the "LIMIT" clause because you're
|
|
> eventually going to want the B's and K's and etc. anyhow -- but only in a
|
|
> stateful enviornment. In the stateless web environment, a cursor is
|
|
> useless because the connection can close at any time even when you're
|
|
> using "persistent" connections (and of course when the connection closes
|
|
> the cursor closes).
|
|
|
|
I'm missing something. Well it's right that in the stateless
|
|
web environment a cursor has to be declared and closed for
|
|
any single CGI call. But even if you have a LIMIT clause,
|
|
your CGI must know with which key to start.
|
|
|
|
So your query must look like
|
|
|
|
SELECT ... WHERE key > 'last processed key' ORDER BY key;
|
|
|
|
And your key must be unique (or at least contain no duplicate
|
|
entries) or you might miss some rows between the pages (have
|
|
100 Brown's in the table and last processed key was a Brown
|
|
while using LIMIT).
|
|
|
|
In postgres you could actually do the following (but read on
|
|
below - it's not optimized correct)
|
|
|
|
BEGIN;
|
|
DECLARE c CURSOR FOR SELECT ... WHERE key > 'last' ORDER BY key;
|
|
FETCH 20 IN c;
|
|
(process the 20 rows in CGI)
|
|
CLOSE c;
|
|
COMMIT;
|
|
|
|
Having LIMIT looks more elegant and has less overhead in CGI-
|
|
backend communication. But the cursor version is SQL
|
|
standard and portable.
|
|
|
|
>
|
|
> I wanted very badly to use PostgreSQL for a web project I'm working on,
|
|
> but it just wouldn't do the job :-(.
|
|
|
|
I've done some tests and what I found out might be a bug in
|
|
PostgreSQL's query optimizer. Having a table with 25k rows
|
|
where key is a text field with a unique index. Now I used
|
|
EXPLAIN for some queries
|
|
|
|
SELECT * FROM tab;
|
|
|
|
results in a seqscan - expected.
|
|
|
|
SELECT * FROM tab ORDER BY key;
|
|
|
|
results in a sort->seqscan - I would have
|
|
expected an indexscan!
|
|
|
|
SELECT * FROM tab WHERE key > 'G';
|
|
|
|
results in an indexscan - expected.
|
|
|
|
SELECT * FROM tab WHERE key > 'G' ORDER BY key;
|
|
|
|
results in a sort->indexscan - hmmm.
|
|
|
|
These results stay the same even if I blow up the table by
|
|
duplicating all rows (now with a non-unique index) to 100k
|
|
rows and have them presorted in the table.
|
|
|
|
Needless to say that everything is vacuum'd for statistics.
|
|
|
|
The last one is the query we would need in the web
|
|
environment used over a cursor as in the example above. But
|
|
due to the sort, the backend selects until the end of the
|
|
table, sorts them and then returns only the first 20 rows
|
|
(out of sorts result).
|
|
|
|
This is very painful if the qualification (key > ...) points
|
|
to the beginning of the key list.
|
|
|
|
Looking at planner.c I can see, that if there is a sortClause
|
|
in the parsetree, the planner creates a sort node and does
|
|
absolutely not check if there is an index that could be used
|
|
to do it. In the examples above, the sort is absolutely
|
|
needless because the index scan will already return the
|
|
tuples in the right order :-).
|
|
|
|
Somewhere deep in my brain I found a statement that sorting
|
|
sorted data isn't only unnecessary (except the order
|
|
changes), it is slow too compared against sorting randomly
|
|
ordered data.
|
|
|
|
Can we fix this before 6.4 release, will it be a past 6.4 or
|
|
am I doing something wrong here? I think it isn't a fix (it's
|
|
a planner enhancement) so it should really be a past 6.4
|
|
item.
|
|
|
|
For now, the only possibility is to omit the ORDER BY in the
|
|
query and hope the planner will always generate an index scan
|
|
(because of the qualification 'key > ...'). Doing so I
|
|
selected multiple times 20 rows (with the last key qual like
|
|
a CGI would do) in separate transactions. Using cursor and
|
|
fetch speeds up the access by a factor of 1000! But it is
|
|
unsafe and thus NOT RECOMMENDED! It's only a test if cursors
|
|
can do the LIMIT job - and they could if the planner would do
|
|
a better job.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 14 11:02:04 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id LAA25519
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 11:02:02 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id JAA24583 for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 09:46:21 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id IAA17022;
|
|
Wed, 14 Oct 1998 08:59:20 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 08:54:40 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id IAA16687
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 08:54:34 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from ra.sai.msu.su (ra.sai.msu.su [158.250.29.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id IAA16656
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 08:54:00 -0400 (EDT)
|
|
(envelope-from oleg@sai.msu.su)
|
|
Received: from ra (ra [158.250.29.2])
|
|
by ra.sai.msu.su (8.9.1/8.9.1) with SMTP id PAA11714;
|
|
Wed, 14 Oct 1998 15:53:53 +0300 (MSK)
|
|
Date: Wed, 14 Oct 1998 16:53:53 +0400 (MSD)
|
|
From: Oleg Bartunov <oleg@sai.msu.su>
|
|
X-Sender: megera@ra
|
|
Reply-To: Oleg Bartunov <oleg@sai.msu.su>
|
|
To: hackers@postgreSQL.org
|
|
cc: t-ishii@sra.co.jp
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
In-Reply-To: <m0zTOnx-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
Message-ID: <Pine.GSO.3.96.SK.981014163020.10948B-100000@ra>
|
|
Organization: Sternberg Astronomical Institute (Moscow University)
|
|
MIME-Version: 1.0
|
|
Content-Type: TEXT/PLAIN; charset=US-ASCII
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
On Wed, 14 Oct 1998, Jan Wieck wrote:
|
|
|
|
> Date: Wed, 14 Oct 1998 13:09:21 +0200 (MET DST)
|
|
> From: Jan Wieck <jwieck@debis.com>
|
|
> To: Eric Lee Green <eric@linux-hw.com>
|
|
> Cc: jeff@remapcorp.com, hackers@postgreSQL.org
|
|
> Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
>
|
|
> Eric Lee Green wrote:
|
|
> >
|
|
> > On Tue, 13 Oct 1998, Jeff Hoffmann wrote:
|
|
> > > >I agree completely, LIMIT would be VERY usefull in web based apps, which
|
|
> > > >is all I run. It does not matter to me if it is not part of a formal
|
|
> > > >standard. The idea is so common that it is a defacto standard.
|
|
> > >
|
|
> > > i'm not familiar with mysql and using "LIMIT" but wouldn't this same effect
|
|
> > > be achieved by declaring a cursor and fetching however many records in the
|
|
> > > cursor? it's a very noticeable improvement when you only want the first 20
|
|
> > > out of 500 in a 200k record database, at least.
|
|
> >
|
|
> > The problem with declaring a cursor vs. the "LIMIT" clause is that the
|
|
> > "LIMIT" clause, if used properly by the database engine (along with the
|
|
> > database engine using indexes in "ORDER BY" clauses) allows the database
|
|
> > engine to short-circuit the tail end of the query. That is, if you have 25
|
|
> > names and the last one ends with BEAVIS, the database engine doesn't have
|
|
> > to go through the BUTTHEADS and KENNYs and etc.
|
|
> >
|
|
> > Theoretically a cursor is superior to the "LIMIT" clause because you're
|
|
> > eventually going to want the B's and K's and etc. anyhow -- but only in a
|
|
> > stateful enviornment. In the stateless web environment, a cursor is
|
|
> > useless because the connection can close at any time even when you're
|
|
> > using "persistent" connections (and of course when the connection closes
|
|
> > the cursor closes).
|
|
>
|
|
> I'm missing something. Well it's right that in the stateless
|
|
> web environment a cursor has to be declared and closed for
|
|
> any single CGI call. But even if you have a LIMIT clause,
|
|
> your CGI must know with which key to start.
|
|
>
|
|
This is not a problem for CGI-script to know which key to start.
|
|
Without LIMIT every CGI call backend will do *FULL* selection
|
|
and cursor helps just in fetching a definite number of rows,
|
|
in principle I can do this with CGI-script. Also, cursor
|
|
returns data back in ASCII format (man l declare) and this requires
|
|
additional job for backend to convert data from intrinsic (binary)
|
|
format. Right implementation of LIMIT offset,number_of_rows could be
|
|
a great win and make postgres superior free database engine for
|
|
Web applications. Many colleagues of mine used mysql instead of
|
|
postgres just because of lacking LIMIT. Tatsuo posted a patch
|
|
for set query_limit to 'num', I just tested it and seems it
|
|
works fine. Now, we need only possibility to specify offset,
|
|
say
|
|
set query_limit to 'offset,num'
|
|
( Tatsuo, How difficult to do this ?)
|
|
and LIMIT problem will ne gone.
|
|
|
|
I'm wonder how many useful patches could be hidden from people :-),
|
|
|
|
Regards,
|
|
|
|
Oleg
|
|
|
|
PS.
|
|
|
|
Tatsuo, do you have patch for 6.3.2 ?
|
|
I can't wait for 6.4 :-)
|
|
_____________________________________________________________
|
|
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
|
|
Sternberg Astronomical Institute, Moscow University (Russia)
|
|
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
|
|
phone: +007(095)939-16-83, +007(095)939-23-83
|
|
|
|
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 14 11:02:00 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id LAA25510
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 11:01:59 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id KAA28854 for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 10:40:56 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id KAA21542;
|
|
Wed, 14 Oct 1998 10:03:45 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 09:59:10 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id JAA21121
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 09:59:08 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from golem.jpl.nasa.gov (root@hectic-2.jpl.nasa.gov [128.149.68.204])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id JAA21106
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 09:59:02 -0400 (EDT)
|
|
(envelope-from lockhart@alumni.caltech.edu)
|
|
Received: from alumni.caltech.edu (localhost [127.0.0.1])
|
|
by golem.jpl.nasa.gov (8.8.5/8.8.5) with ESMTP id NAA19587;
|
|
Wed, 14 Oct 1998 13:59:56 GMT
|
|
Message-ID: <3624AE5C.752E4E7F@alumni.caltech.edu>
|
|
Date: Wed, 14 Oct 1998 13:59:56 +0000
|
|
From: "Thomas G. Lockhart" <lockhart@alumni.caltech.edu>
|
|
Organization: Caltech/JPL
|
|
X-Mailer: Mozilla 4.07 [en] (X11; I; Linux 2.0.30 i686)
|
|
MIME-Version: 1.0
|
|
To: Jan Wieck <jwieck@debis.com>
|
|
CC: Eric Lee Green <eric@linux-hw.com>, jeff@remapcorp.com,
|
|
hackers@postgreSQL.org
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
References: <m0zTOnx-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
Content-Type: text/plain; charset=us-ascii
|
|
Content-Transfer-Encoding: 7bit
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: ROr
|
|
|
|
> I've done some tests and what I found out might be a bug in
|
|
> PostgreSQL's query optimizer.
|
|
> SELECT * FROM tab ORDER BY key;
|
|
> results in a sort->seqscan - I would have
|
|
> expected an indexscan!
|
|
|
|
Given that a table _could_ be completely unsorted on disk, it is
|
|
probably reasonable to suck the data in for a possible in-memory sort
|
|
rather than skipping around the disk to pick up individual tuples via
|
|
the index. Don't know if vacuum has a statistic on "orderness"...
|
|
|
|
> SELECT * FROM tab WHERE key > 'G' ORDER BY key;
|
|
> results in a sort->indexscan - hmmm.
|
|
> The last one is the query we would need in the web
|
|
> environment used over a cursor as in the example above. But
|
|
> due to the sort, the backend selects until the end of the
|
|
> table, sorts them and then returns only the first 20 rows
|
|
> (out of sorts result).
|
|
|
|
So you are saying that for this last case the sort was unnecessary? Does
|
|
the backend traverse the index in the correct order to guarantee that
|
|
the tuples are coming out already sorted? Does a hash index give the
|
|
same plan (I would expect a sort->seqscan for a hash index)?
|
|
|
|
- Tom
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 14 11:01:52 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id LAA25504
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 11:01:51 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id KAA00198 for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 10:57:15 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id KAA22877;
|
|
Wed, 14 Oct 1998 10:19:47 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 10:15:44 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id KAA22675
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 10:15:41 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id KAA22657
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 10:15:32 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id QAA20563; Wed, 14 Oct 1998 16:18:02 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma020404; Wed, 14 Oct 98 16:17:25 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id QAA05077;
|
|
Wed, 14 Oct 1998 16:14:48 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id QAA22248;
|
|
Wed, 14 Oct 1998 16:17:06 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<hackers@postgreSQL.org>>
|
|
id m0zTPJb-000B5AC; Wed, 14 Oct 98 13:42 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for hackers@postgreSQL.org
|
|
id m0zTRrE-000EBRC; Wed, 14 Oct 98 16:24 MET DST
|
|
Message-Id: <m0zTRrE-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: oleg@sai.msu.su
|
|
Date: Wed, 14 Oct 1998 16:24:56 +0200 (MET DST)
|
|
Cc: hackers@postgreSQL.org, t-ishii@sra.co.jp
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <Pine.GSO.3.96.SK.981014163020.10948B-100000@ra> from "Oleg Bartunov" at Oct 14, 98 04:53:53 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
Oleg Bartunov wrote:
|
|
> This is not a problem for CGI-script to know which key to start.
|
|
|
|
Never meant that would be a problem. A FORM variable will of
|
|
course do this.
|
|
|
|
> Without LIMIT every CGI call backend will do *FULL* selection
|
|
> and cursor helps just in fetching a definite number of rows,
|
|
> in principle I can do this with CGI-script. Also, cursor
|
|
> returns data back in ASCII format (man l declare) and this requires
|
|
> additional job for backend to convert data from intrinsic (binary)
|
|
> format. Right implementation of LIMIT offset,number_of_rows could be
|
|
> a great win and make postgres superior free database engine for
|
|
> Web applications. Many colleagues of mine used mysql instead of
|
|
|
|
That's the point I was missing. The offset!
|
|
|
|
> postgres just because of lacking LIMIT. Tatsuo posted a patch
|
|
> for set query_limit to 'num', I just tested it and seems it
|
|
> works fine. Now, we need only possibility to specify offset,
|
|
> say
|
|
> set query_limit to 'offset,num'
|
|
> ( Tatsuo, How difficult to do this ?)
|
|
> and LIMIT problem will ne gone.
|
|
|
|
Think you haven't read my posting completely. Even with the
|
|
executor limit, the complete scan into the sort is done by
|
|
the backend. You need to specify ORDER BY to get the same
|
|
list again (without the offset doesn't make sense). But
|
|
currently, ORDER BY forces a sort node into the query plan.
|
|
|
|
What the executor limit tells is how many rows will be
|
|
returned from the sorted data. Not what goes into the sort.
|
|
Filling the sort and sorting the data consumes the most time
|
|
of the queries execution.
|
|
|
|
I haven't looked at Tatsuo's patch very well. But if it
|
|
limits the amount of data going into the sort (on ORDER BY),
|
|
it will break it! The requested ordering could be different
|
|
from what the choosen index might return. The used index is
|
|
choosen by the planner upon the qualifications given, not the
|
|
ordering wanted.
|
|
|
|
So if you select WHERE b = 1 ORDER BY a, then it will use an
|
|
index on attribute b to match the qualification. The complete
|
|
result of that index scan goes into the sort to get ordered
|
|
by a. If now the executor limit stops sort filling after the
|
|
limit is exceeded, only the same tuples will go into the sort
|
|
every time. But they have nothing to do with the requested
|
|
order by a.
|
|
|
|
What LIMIT first needs is a planner enhancement. In file
|
|
backend/optimizer/plan/planner.c line 284 it must be checked
|
|
if the actual plan is an indexscan, if the indexed attributes
|
|
are all the same as those in the given sort clause and that
|
|
the requested sort order (operator) is that what the index
|
|
will return. If that all matches, it can ignore the sort
|
|
clause and return the index scan itself.
|
|
|
|
Second enhancement must be the handling of the offset. In
|
|
the executor, the index scan must skip offset index tuples
|
|
before returning the first. But NOT if the plan isn't a
|
|
1-table-index-scan. In that case the result tuples (from the
|
|
topmost unique/join/whatever node) have to be skipped.
|
|
|
|
With these enhancements, the index tuples to be skipped
|
|
(offset) will still be scanned, but not the data tuples they
|
|
point to. Index scanning might be somewhat faster.
|
|
|
|
This all will only speedup simple 1-table-queries, no joins
|
|
or if the requested order isn't that what the index exactly
|
|
returns.
|
|
|
|
Anyway, I'll take a look if I can change the planner to omit
|
|
the sort if the tests described above are true. I think it
|
|
would be good anyway.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 14 11:01:36 1998
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id LAA25489
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 11:01:34 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id KAA24286;
|
|
Wed, 14 Oct 1998 10:30:14 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 10:26:34 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id KAA23732
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 10:26:27 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id KAA23717
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 10:26:13 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id QAA25644; Wed, 14 Oct 1998 16:28:01 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma025301; Wed, 14 Oct 98 16:27:43 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id QAA05943;
|
|
Wed, 14 Oct 1998 16:24:42 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id QAA22339;
|
|
Wed, 14 Oct 1998 16:26:57 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<lockhart@alumni.caltech.edu>>
|
|
id m0zTPT8-000B5AC; Wed, 14 Oct 98 13:51 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for lockhart@alumni.caltech.edu
|
|
id m0zTS0m-000EBRC; Wed, 14 Oct 98 16:34 MET DST
|
|
Message-Id: <m0zTS0m-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: lockhart@alumni.caltech.edu (Thomas G. Lockhart)
|
|
Date: Wed, 14 Oct 1998 16:34:47 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, eric@linux-hw.com, jeff@remapcorp.com,
|
|
hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <3624AE5C.752E4E7F@alumni.caltech.edu> from "Thomas G. Lockhart" at Oct 14, 98 01:59:56 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
>
|
|
> > SELECT * FROM tab WHERE key > 'G' ORDER BY key;
|
|
> > results in a sort->indexscan - hmmm.
|
|
> > The last one is the query we would need in the web
|
|
> > environment used over a cursor as in the example above. But
|
|
> > due to the sort, the backend selects until the end of the
|
|
> > table, sorts them and then returns only the first 20 rows
|
|
> > (out of sorts result).
|
|
>
|
|
> So you are saying that for this last case the sort was unnecessary? Does
|
|
> the backend traverse the index in the correct order to guarantee that
|
|
> the tuples are coming out already sorted? Does a hash index give the
|
|
> same plan (I would expect a sort->seqscan for a hash index)?
|
|
|
|
Good point! As far as I can see, the planner chooses index
|
|
usage only depending on the WHERE clause. A hash index is
|
|
only usable when the given qualification uses = on the
|
|
indexed attribute(s).
|
|
|
|
If the sortClause exactly matches the indexed attributes of
|
|
the ONE used btree index and all operators request ascending
|
|
order I think the index scan already returns the correct
|
|
order. Who know's definitely?
|
|
|
|
Addition to my last posting: ... and if the index scan is
|
|
using a btree index ...
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 14 13:55:58 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id NAA29300
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 13:55:56 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id NAA14245 for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 13:49:19 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id NAA13110;
|
|
Wed, 14 Oct 1998 13:25:55 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 13:22:14 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id NAA12694
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 13:22:13 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from candle.pha.pa.us (maillist@s5-03.ppp.op.net [209.152.195.67])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id NAA12677
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 13:22:05 -0400 (EDT)
|
|
(envelope-from maillist@candle.pha.pa.us)
|
|
Received: (from maillist@localhost)
|
|
by candle.pha.pa.us (8.9.0/8.9.0) id NAA28746;
|
|
Wed, 14 Oct 1998 13:21:15 -0400 (EDT)
|
|
From: Bruce Momjian <maillist@candle.pha.pa.us>
|
|
Message-Id: <199810141721.NAA28746@candle.pha.pa.us>
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
In-Reply-To: <3624AE5C.752E4E7F@alumni.caltech.edu> from "Thomas G. Lockhart" at "Oct 14, 1998 1:59:56 pm"
|
|
To: lockhart@alumni.caltech.edu (Thomas G. Lockhart)
|
|
Date: Wed, 14 Oct 1998 13:21:15 -0400 (EDT)
|
|
Cc: jwieck@debis.com, eric@linux-hw.com, jeff@remapcorp.com,
|
|
hackers@postgreSQL.org
|
|
X-Mailer: ELM [version 2.4ME+ PL47 (25)]
|
|
MIME-Version: 1.0
|
|
Content-Type: text/plain; charset=US-ASCII
|
|
Content-Transfer-Encoding: 7bit
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: ROr
|
|
|
|
> > I've done some tests and what I found out might be a bug in
|
|
> > PostgreSQL's query optimizer.
|
|
> > SELECT * FROM tab ORDER BY key;
|
|
> > results in a sort->seqscan - I would have
|
|
> > expected an indexscan!
|
|
>
|
|
> Given that a table _could_ be completely unsorted on disk, it is
|
|
> probably reasonable to suck the data in for a possible in-memory sort
|
|
> rather than skipping around the disk to pick up individual tuples via
|
|
> the index. Don't know if vacuum has a statistic on "orderness"...
|
|
|
|
Thomas is correct on this. Vadim has run some tests, and with our
|
|
optimized psort() code, the in-memory sort is often faster than using
|
|
the index to get the tuple, because you are jumping all over the drive.
|
|
I don't remember, but obviously there is a break-even point where
|
|
getting X rows using the index on a table of Y rows is faster , but
|
|
getting X+1 rows on a table of Y rows is faster getting all the rows
|
|
sequentailly, and doing the sort.
|
|
|
|
You would have to pick only certain queries(no joins, index matches
|
|
ORDER BY), take the number of rows requested, and the number of rows
|
|
selected, and figure out if it is faster to use the index, or a
|
|
sequential scan and do the ORDER BY yourself.
|
|
|
|
|
|
Add to this the OFFSET capability. I am not sure how you are going to
|
|
get into the index and start at the n-th entry, unless perhaps you just
|
|
sequential scan the index.
|
|
|
|
In fact, many queries just get column already indexed, and we could just
|
|
pull the data right out of the index.
|
|
|
|
I have added this to the TODO list:
|
|
|
|
* Pull requested data directly from indexes, bypassing heap data
|
|
|
|
I think this has to be post-6.4 work, but I think we need to work in
|
|
this direction. I am holding off any cnfify fixes for post-6.4, but a
|
|
6.4.1 performance release certainly is possible.
|
|
|
|
|
|
But, you are correct that certain cases where in index is already being
|
|
used on a query, you could just skip the sort IF you used the index to
|
|
get the rows from the base table.
|
|
|
|
--
|
|
Bruce Momjian | http://www.op.net/~candle
|
|
maillist@candle.pha.pa.us | (610) 853-3000
|
|
+ If your life is a hard drive, | 830 Blythe Avenue
|
|
+ Christ can be your backup. | Drexel Hill, Pennsylvania 19026
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 14 13:55:59 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id NAA29303
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 13:55:58 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id NAA13463 for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 13:39:05 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id NAA11655;
|
|
Wed, 14 Oct 1998 13:13:32 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 13:09:41 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id NAA11013
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 13:09:39 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from terry1.acun.com (terry@terry1.acun.com [206.27.86.12])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id NAA10997
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 13:09:30 -0400 (EDT)
|
|
(envelope-from terry@terrym.com)
|
|
Received: from localhost (terry@localhost)
|
|
by terry1.acun.com (8.8.5/8.8.5) with SMTP id NAA14478;
|
|
Wed, 14 Oct 1998 13:21:51 -0400
|
|
Date: Wed, 14 Oct 1998 13:21:51 -0400 (EDT)
|
|
From: Terry Mackintosh <terry@terrym.com>
|
|
X-Sender: terry@terry1.acun.com
|
|
To: Jeff Hoffmann <jeff@remapcorp.com>
|
|
cc: PostgreSQL-development <hackers@postgreSQL.org>
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
In-Reply-To: <005101bdf6de$f9345150$c525c4ce@go-to-jail.remapcorp.com>
|
|
Message-ID: <Pine.LNX.3.95.981014130857.14397B-100000@terry1.acun.com>
|
|
MIME-Version: 1.0
|
|
Content-Type: TEXT/PLAIN; charset=US-ASCII
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
On Tue, 13 Oct 1998, Jeff Hoffmann wrote:
|
|
|
|
> >Hi, my 2 cents...
|
|
> >
|
|
> >I agree completely, LIMIT would be VERY usefull in web based apps, which
|
|
> >is all I run. It does not matter to me if it is not part of a formal
|
|
> >standard. The idea is so common that it is a defacto standard.
|
|
>
|
|
> i'm not familiar with mysql and using "LIMIT" but wouldn't this same effect
|
|
> be achieved by declaring a cursor and fetching however many records in the
|
|
> cursor? it's a very noticeable improvement when you only want the first 20
|
|
> out of 500 in a 200k record database, at least.
|
|
|
|
Yes, while this is an improvement, it still has to do the entire query,
|
|
would be nice if the query could be terminated after a designated number
|
|
of rows where found, thus freeing system resources that are other wise
|
|
consumed.
|
|
I have seen web users run ridculous querys, like search for the
|
|
letter 'a', and it happens to be a substring search, now the box go'es ape
|
|
shit for 5 or 10 min.s while it basically gets the whole db as the search
|
|
result. All this befor you can do a 'FETCH', as I understand FETCH, I
|
|
will need to read up on it.
|
|
|
|
Note that I do not have any databases that larg on my box, I was thinking
|
|
back to my VFP/NT experiances.
|
|
|
|
Have a great day
|
|
Terry Mackintosh <terry@terrym.com> http://www.terrym.com
|
|
sysadmin/owner Please! No MIME encoded or HTML mail, unless needed.
|
|
|
|
Proudly powered by R H Linux 4.2, Apache 1.3, PHP 3, PostgreSQL 6.3
|
|
-------------------------------------------------------------------
|
|
Success Is A Choice ... book by Rick Patino, get it, read it!
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 14 13:59:05 1998
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id NAA29345
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 13:58:59 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id NAA14021;
|
|
Wed, 14 Oct 1998 13:32:51 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 13:29:09 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id NAA13364
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 13:29:07 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from terry1.acun.com (terry@terry1.acun.com [206.27.86.12])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id NAA13328
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 13:28:56 -0400 (EDT)
|
|
(envelope-from terry@terrym.com)
|
|
Received: from localhost (terry@localhost)
|
|
by terry1.acun.com (8.8.5/8.8.5) with SMTP id NAA14606
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 13:41:25 -0400
|
|
Date: Wed, 14 Oct 1998 13:41:24 -0400 (EDT)
|
|
From: Terry Mackintosh <terry@terrym.com>
|
|
X-Sender: terry@terry1.acun.com
|
|
To: PostgreSQL-development <hackers@postgreSQL.org>
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
In-Reply-To: <199810132116.RAA11249@candle.pha.pa.us>
|
|
Message-ID: <Pine.LNX.3.95.981014133641.14397D-100000@terry1.acun.com>
|
|
MIME-Version: 1.0
|
|
Content-Type: TEXT/PLAIN; charset=US-ASCII
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
On Tue, 13 Oct 1998, Bruce Momjian wrote:
|
|
|
|
> What we could do is _if_ there is only one table(no joins), and an index
|
|
> exists that matches the ORDER BY, we could use the index to
|
|
> short-circuit the query.
|
|
>
|
|
> I have added this item to the TODO list:
|
|
>
|
|
> * Allow LIMIT ability on single-table queries that have no ORDER BY or
|
|
> a matching index
|
|
>
|
|
> This looks do-able, and a real win. Would this make web applications
|
|
> happier? If there is an ORDER BY and no index, or a join, I can't
|
|
> figure out how we would short-circuit the query.
|
|
>
|
|
Yes, this would do for most of my apps.
|
|
It may just be my lack of sophistication, but I find that most web apps
|
|
are very simple in nature/table layout, and thus queries are often on only
|
|
a single table.
|
|
|
|
Thanks
|
|
Terry Mackintosh <terry@terrym.com> http://www.terrym.com
|
|
sysadmin/owner Please! No MIME encoded or HTML mail, unless needed.
|
|
|
|
Proudly powered by R H Linux 4.2, Apache 1.3, PHP 3, PostgreSQL 6.3
|
|
-------------------------------------------------------------------
|
|
Success Is A Choice ... book by Rick Patino, get it, read it!
|
|
|
|
|
|
|
|
From wieck@sapserv.debis.de Wed Oct 14 13:55:53 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id NAA29290
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 13:55:51 -0400 (EDT)
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id NAA14370 for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 13:51:19 -0400 (EDT)
|
|
Received: by dsh.de; id TAA03418; Wed, 14 Oct 1998 19:50:18 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma003369; Wed, 14 Oct 98 19:49:51 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id TAA16746;
|
|
Wed, 14 Oct 1998 19:47:14 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id TAA23570;
|
|
Wed, 14 Oct 1998 19:49:32 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<lockhart@alumni.caltech.edu>>
|
|
id m0zTSdF-000B5AC; Wed, 14 Oct 98 17:14 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for lockhart@alumni.caltech.edu
|
|
id m0zTVAt-000EBRC; Wed, 14 Oct 98 19:57 MET DST
|
|
Message-Id: <m0zTVAt-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Wed, 14 Oct 1998 19:57:27 +0200 (MET DST)
|
|
Cc: lockhart@alumni.caltech.edu, jwieck@debis.com, eric@linux-hw.com,
|
|
jeff@remapcorp.com, hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810141721.NAA28746@candle.pha.pa.us> from "Bruce Momjian" at Oct 14, 98 01:21:15 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Status: RO
|
|
|
|
> But, you are correct that certain cases where in index is already being
|
|
> used on a query, you could just skip the sort IF you used the index to
|
|
> get the rows from the base table.
|
|
|
|
Especially in the case where
|
|
|
|
SELECT ... WHERE key > 'val' ORDER BY key;
|
|
|
|
creates a Sort->IndexScan plan. The index scan already jumps
|
|
around on the disc to collect the sorts input and the sort
|
|
finally returns exactly the same output (if the used index is
|
|
only on key).
|
|
|
|
And this is the case for large tables. The planner first
|
|
decides to use an index scan due to the WHERE clause and
|
|
later it notices the ORDER BY clause and creates a sort over
|
|
the scan.
|
|
|
|
I'm actually hacking around on it to see what happens if I
|
|
suppress the sort node in some cases.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 14 16:31:07 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id QAA01119
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 16:31:05 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id PAA22534 for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 15:29:50 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id PAA26335;
|
|
Wed, 14 Oct 1998 15:05:26 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 15:02:13 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id PAA26013
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 15:02:11 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from candle.pha.pa.us (root@s5-03.ppp.op.net [209.152.195.67])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id PAA25996
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 15:01:58 -0400 (EDT)
|
|
(envelope-from maillist@candle.pha.pa.us)
|
|
Received: (from maillist@localhost)
|
|
by candle.pha.pa.us (8.9.0/8.9.0) id OAA29639;
|
|
Wed, 14 Oct 1998 14:27:05 -0400 (EDT)
|
|
From: Bruce Momjian <maillist@candle.pha.pa.us>
|
|
Message-Id: <199810141827.OAA29639@candle.pha.pa.us>
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
In-Reply-To: <199810141721.NAA28746@candle.pha.pa.us> from Bruce Momjian at "Oct 14, 1998 1:21:15 pm"
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Wed, 14 Oct 1998 14:27:05 -0400 (EDT)
|
|
Cc: lockhart@alumni.caltech.edu, jwieck@debis.com, eric@linux-hw.com,
|
|
jeff@remapcorp.com, hackers@postgreSQL.org
|
|
X-Mailer: ELM [version 2.4ME+ PL47 (25)]
|
|
MIME-Version: 1.0
|
|
Content-Type: text/plain; charset=US-ASCII
|
|
Content-Transfer-Encoding: 7bit
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
> Thomas is correct on this. Vadim has run some tests, and with our
|
|
> optimized psort() code, the in-memory sort is often faster than using
|
|
> the index to get the tuple, because you are jumping all over the drive.
|
|
> I don't remember, but obviously there is a break-even point where
|
|
> getting X rows using the index on a table of Y rows is faster , but
|
|
> getting X+1 rows on a table of Y rows is faster getting all the rows
|
|
> sequentailly, and doing the sort.
|
|
>
|
|
> You would have to pick only certain queries(no joins, index matches
|
|
> ORDER BY), take the number of rows requested, and the number of rows
|
|
> selected, and figure out if it is faster to use the index, or a
|
|
> sequential scan and do the ORDER BY yourself.
|
|
>
|
|
> Add to this the OFFSET capability. I am not sure how you are going to
|
|
> get into the index and start at the n-th entry, unless perhaps you just
|
|
> sequential scan the index.
|
|
>
|
|
> In fact, many queries just get column already indexed, and we could just
|
|
> pull the data right out of the index.
|
|
>
|
|
> I have added this to the TODO list:
|
|
>
|
|
> * Pull requested data directly from indexes, bypassing heap data
|
|
>
|
|
> I think this has to be post-6.4 work, but I think we need to work in
|
|
> this direction. I am holding off any cnfify fixes for post-6.4, but a
|
|
> 6.4.1 performance release certainly is possible.
|
|
>
|
|
>
|
|
> But, you are correct that certain cases where in index is already being
|
|
> used on a query, you could just skip the sort IF you used the index to
|
|
> get the rows from the base table.
|
|
|
|
I have had more time to think about this. Basically, for pre-sorted
|
|
data, our psort code is very fast, because it does not need to sort
|
|
anything. It just moves the rows in and out of the sort memory. Yes,
|
|
it could be removed in some cases, and probably should be, but it is not
|
|
going to produce great speedups.
|
|
|
|
The more general case I will describe below.
|
|
|
|
First, let's look at a normal query:
|
|
|
|
SELECT *
|
|
FROM tab
|
|
ORDER BY col1
|
|
|
|
This is not going to use an index, and probably should not because it is
|
|
faster for large tables to sort them in memory, rather than moving all
|
|
over the disk. For small tables, if the entire table fits in the buffer
|
|
cache, it may be faster to use the index, but on a small table the sort
|
|
doesn't take very long either, and the buffer cache effectiveness is
|
|
affected by other backends using it, so it may be better not to count on
|
|
it for a speedup.
|
|
|
|
However, if you only want the first 10 rows, that is a different story.
|
|
We pull all the rows into the backend, sort them, then return 10 rows.
|
|
The query, if we could do it, should be written as:
|
|
|
|
SELECT *
|
|
FROM tab
|
|
WHERE col1 < some_unknown_value
|
|
ORDER BY col1
|
|
|
|
In this case, the optimizer looks at the column statistics, and properly
|
|
uses an index to pull only a small subset of the table. This is the
|
|
type of behavior people want for queries returning only a few values.
|
|
|
|
But, unfortunately, we don't know that mystery value.
|
|
|
|
Now, everyone agrees we need an index matching the ORDER BY to make this
|
|
query quick, but we don't know that mystery value, so currently we
|
|
execute the whole query, and do a fetch.
|
|
|
|
What I am now thinking is that maybe we need a way to walk around that
|
|
index. Someone months ago asked how to do that, and we told him he
|
|
couldn't, because this not a C-ISAM/dbm type database. However, if we
|
|
could somehow pass into the query the index location we want to start
|
|
at, and how many rows we need, that would solve our problem, and perhaps
|
|
even allow joined queries to work, assuming the table in the ORDER BY is
|
|
in an outer join loop.
|
|
|
|
SELECT *
|
|
FROM tab
|
|
WHERE col1 < some_unknown_value
|
|
ORDER BY col1
|
|
USING INDEX tab_idx(452) COUNT 100
|
|
|
|
where 452 is an 452th index entry, and COUNT is the number of index rows
|
|
you want to process. The query may return more or less than 100 rows if
|
|
there is a join and it joins to zero or more than one row in the joined
|
|
table, but this seems like perhaps a good way to go at it. We need to
|
|
do it this way because if a single index row returns 4 result rows, and
|
|
only two of the four rows fit in the number of rows returnd as set by the
|
|
user, it is hard to re-start the query at the proper point, because you
|
|
would have to process the index rows a second time, and return just part
|
|
of the result, and that is hard.
|
|
|
|
If the index changes, or rows are added, the results are going to be
|
|
unreliable, but that is probably going to be true of any state-less
|
|
implementation we can devise.
|
|
|
|
I think this may be fairly easy to implement. We could sequential scan
|
|
the index to get to the 452th row. That is going to be quick. We can
|
|
pass the 452 into the btree index code, so only a certain range of index
|
|
tuples are returned, and the system believes it has processed the entire
|
|
query, while we know it hasn't. Doesn't really work with hash, so we
|
|
will not allow it for those indexes.
|
|
|
|
To make it really easy, we could implement it as a 'SET' command, so we
|
|
don't actually have it as part of the query, and have to pass it around
|
|
through all the modules. You would do the proper 'SET' before running
|
|
the query. Optimizer would look at 'SET' value to force index use.
|
|
|
|
SET INDEX TO tab_idx START 452 COUNT 100
|
|
|
|
or
|
|
|
|
SET INDEX TO tab_idx FROM 452 COUNT 451
|
|
|
|
There would have to be some way to signal that the end of the index had
|
|
been reached, because returning zero rows is not enough of a guarantee
|
|
in a joined SELECT.
|
|
|
|
Comments?
|
|
|
|
--
|
|
Bruce Momjian | http://www.op.net/~candle
|
|
maillist@candle.pha.pa.us | (610) 853-3000
|
|
+ If your life is a hard drive, | 830 Blythe Avenue
|
|
+ Christ can be your backup. | Drexel Hill, Pennsylvania 19026
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 14 17:31:23 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id RAA01591
|
|
for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 17:31:21 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id RAA02744 for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 17:26:53 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id RAA05601;
|
|
Wed, 14 Oct 1998 17:03:21 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 16:59:54 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id QAA04964
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 16:59:52 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id QAA04943
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 16:59:28 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id WAA28383; Wed, 14 Oct 1998 22:57:42 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma028354; Wed, 14 Oct 98 22:57:28 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id WAA20547;
|
|
Wed, 14 Oct 1998 22:54:51 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id WAA24383;
|
|
Wed, 14 Oct 1998 22:57:09 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<lockhart@alumni.caltech.edu>>
|
|
id m0zTVYr-000B5AC; Wed, 14 Oct 98 20:22 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for lockhart@alumni.caltech.edu
|
|
id m0zTY6V-000EBRC; Wed, 14 Oct 98 23:05 MET DST
|
|
Message-Id: <m0zTY6V-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Wed, 14 Oct 1998 23:05:07 +0200 (MET DST)
|
|
Cc: maillist@candle.pha.pa.us, lockhart@alumni.caltech.edu, jwieck@debis.com,
|
|
eric@linux-hw.com, jeff@remapcorp.com, hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810141827.OAA29639@candle.pha.pa.us> from "Bruce Momjian" at Oct 14, 98 02:27:05 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: ROr
|
|
|
|
> I have had more time to think about this. Basically, for pre-sorted
|
|
> data, our psort code is very fast, because it does not need to sort
|
|
> anything. It just moves the rows in and out of the sort memory. Yes,
|
|
> it could be removed in some cases, and probably should be, but it is not
|
|
> going to produce great speedups.
|
|
|
|
And I got the time to hack around about this.
|
|
|
|
I hacked in a little check into the planner, that compares
|
|
the sortClause against the key field list of an index scan
|
|
and just suppresses the sort node if it exactly matchs and
|
|
all sort operators are "<".
|
|
|
|
I tested with a 10k row table where key is a text field. The
|
|
base query is a
|
|
|
|
SELECT ... WHERE key > 'val' ORDER BY key;
|
|
|
|
The used 'val' is always a key that is close to the first of
|
|
all keys in the table ('' on the first query and the last
|
|
selected value on subsequent ones).
|
|
|
|
Scenario 1 (S1) uses exactly the above query but processes
|
|
only the first 20 rows from the result buffer. Thus the
|
|
frontend receives nearly the whole table.
|
|
|
|
Scenario 2 (S2) uses a cursor and FETCH 20. But closes the
|
|
cursor and creates a new one for the next selection (only
|
|
with another 'val') as it would occur in a web application.
|
|
|
|
If there is no index on key, the backend will allways do a
|
|
Sort->SeqScan and due to the 'val' close to the lowest
|
|
existing key nearly all tuples get scanned and put into the
|
|
sort. S1 here runs about 10 seconds and S2 about 6 seconds.
|
|
The speedup in S2 results from the reduced overhead of
|
|
sending not wanted tuples into the frontend.
|
|
|
|
Now with a btree index on key and an unpatched backend.
|
|
Produced plan is always a Sort->IndexScan. S1 needs 16
|
|
seconds and S2 needs 12 seconds. Again nearly all data is put
|
|
into the sort but this time over the index scan and that is
|
|
slower.
|
|
|
|
Last with the btree index on key and the patched backend.
|
|
This time the plan is a plain IndexScan because the ORDER BY
|
|
clause exactly matches the sort order of the choosen index.
|
|
S1 needs 13 seconds and S2 less than 0.2! This dramatic
|
|
speedup comes from the fact, that this time the index scan is
|
|
the toplevel executor node and the executor run is stopped
|
|
after 20 tuples have been selected.
|
|
|
|
Analysis of the above timings:
|
|
|
|
If there is an ORDER BY clause, using an index scan is the
|
|
clever way if the indexqual dramatically reduces the the
|
|
amount of data selected and sorted. I think this is the
|
|
normal case (who really selects nearly all rows from a 5M row
|
|
table?). So choosing the index path is correct. This will
|
|
hurt if someone really selects most of the rows and the index
|
|
scan jumps over the disc. But here the programmer should use
|
|
an unqualified query to perform a seqscan and do the
|
|
qualification in the frontend application.
|
|
|
|
The speedup for the cursor/fetch scenario is so impressive
|
|
that I'll create a post 6.4 patch. I don't want it in 6.4
|
|
because there is absolutely no query in the whole regression
|
|
test, where it suppresses the sort node. So we have
|
|
absolutely no check that it doesn't break anything.
|
|
|
|
For a web application, that can use a unique key to select
|
|
the next amount of rows, it will be a big win.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Thu Oct 15 00:01:10 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id AAA06040
|
|
for <maillist@candle.pha.pa.us>; Thu, 15 Oct 1998 00:01:04 -0400 (EDT)
|
|
Received: from hub.org (root@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id XAA29020 for <maillist@candle.pha.pa.us>; Wed, 14 Oct 1998 23:57:58 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id WAA02215;
|
|
Wed, 14 Oct 1998 22:39:07 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 14 Oct 1998 22:35:19 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id WAA02061
|
|
for pgsql-hackers-outgoing; Wed, 14 Oct 1998 22:35:16 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from sraigw.sra.co.jp (sraigw.sra.co.jp [202.32.10.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id WAA01851
|
|
for <hackers@postgreSQL.org>; Wed, 14 Oct 1998 22:35:01 -0400 (EDT)
|
|
(envelope-from t-ishii@srapc451.sra.co.jp)
|
|
Received: from srapc451.sra.co.jp (srapc451 [133.137.44.37])
|
|
by sraigw.sra.co.jp (8.8.7/3.6Wbeta7-sraigw) with ESMTP id LAA17765;
|
|
Thu, 15 Oct 1998 11:34:39 +0900 (JST)
|
|
Received: from srapc451.sra.co.jp (localhost [127.0.0.1]) by srapc451.sra.co.jp (8.8.8/3.5Wpl7) with ESMTP id LAA08260; Thu, 15 Oct 1998 11:34:54 +0900 (JST)
|
|
Message-Id: <199810150234.LAA08260@srapc451.sra.co.jp>
|
|
To: jwieck@debis.com (Jan Wieck)
|
|
cc: oleg@sai.msu.su, hackers@postgreSQL.org, t-ishii@sra.co.jp
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
From: Tatsuo Ishii <t-ishii@sra.co.jp>
|
|
Reply-To: t-ishii@sra.co.jp
|
|
In-reply-to: Your message of Wed, 14 Oct 1998 16:24:56 +0200.
|
|
<m0zTRrE-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
Date: Thu, 15 Oct 1998 11:34:54 +0900
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
>> postgres just because of lacking LIMIT. Tatsuo posted a patch
|
|
>> for set query_limit to 'num', I just tested it and seems it
|
|
>> works fine. Now, we need only possibility to specify offset,
|
|
>> say
|
|
>> set query_limit to 'offset,num'
|
|
>> ( Tatsuo, How difficult to do this ?)
|
|
>> and LIMIT problem will ne gone.
|
|
>
|
|
> Think you haven't read my posting completely. Even with the
|
|
> executor limit, the complete scan into the sort is done by
|
|
> the backend. You need to specify ORDER BY to get the same
|
|
> list again (without the offset doesn't make sense). But
|
|
> currently, ORDER BY forces a sort node into the query plan.
|
|
|
|
I think we have understanded your point. set query_limit is just a
|
|
easy alternative of using cursor and fetch.
|
|
|
|
> I haven't looked at Tatsuo's patch very well. But if it
|
|
> limits the amount of data going into the sort (on ORDER BY),
|
|
> it will break it! The requested ordering could be different
|
|
> from what the choosen index might return. The used index is
|
|
> choosen by the planner upon the qualifications given, not the
|
|
> ordering wanted.
|
|
|
|
I think it limits the final result. When query_limit is set,
|
|
the arg "numberTuples" of ExecutePlan() is set to it instead of 0
|
|
(this means no limit).
|
|
|
|
Talking about "offset," it shouldn't be very difficult. I guess all we
|
|
have to do is adding a new arg "offset" to ExecutePlan() then making
|
|
obvious modifications. (and of course we have to modify set
|
|
query_limit syntax but it's trivial)
|
|
|
|
However, before going ahead, I would like to ask other hackers about
|
|
this direction. This might be convenient for some users, but still the
|
|
essential performance issue would remain. In another word, this is a
|
|
short-term solution not a intrinsic one, IMHO.
|
|
--
|
|
Tatsuo Ishii
|
|
t-ishii@sra.co.jp
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Thu Oct 15 10:01:17 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id KAA13960
|
|
for <maillist@candle.pha.pa.us>; Thu, 15 Oct 1998 10:01:15 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id JAA20266 for <maillist@candle.pha.pa.us>; Thu, 15 Oct 1998 09:12:21 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id IAA26142;
|
|
Thu, 15 Oct 1998 08:19:49 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Thu, 15 Oct 1998 08:13:48 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id IAA25747
|
|
for pgsql-hackers-outgoing; Thu, 15 Oct 1998 08:13:46 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id IAA25733
|
|
for <hackers@postgreSQL.org>; Thu, 15 Oct 1998 08:13:40 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id OAA18677; Thu, 15 Oct 1998 14:16:12 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma018279; Thu, 15 Oct 98 14:15:39 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id OAA01227;
|
|
Thu, 15 Oct 1998 14:13:09 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id OAA28938;
|
|
Thu, 15 Oct 1998 14:15:27 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<jwieck@debis.com>>
|
|
id m0zTjtm-000B5AC; Thu, 15 Oct 98 11:40 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for jwieck@debis.com
|
|
id m0zTmRT-000EBRC; Thu, 15 Oct 98 14:23 MET DST
|
|
Message-Id: <m0zTmRT-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: t-ishii@sra.co.jp
|
|
Date: Thu, 15 Oct 1998 14:23:43 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, oleg@sai.msu.su, hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810150234.LAA08260@srapc451.sra.co.jp> from "Tatsuo Ishii" at Oct 15, 98 11:34:54 am
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
Tatsuo Ishii wrote:
|
|
|
|
> I think we have understanded your point. set query_limit is just a
|
|
> easy alternative of using cursor and fetch.
|
|
>
|
|
> > I haven't looked at Tatsuo's patch very well. But if it
|
|
> > limits the amount of data going into the sort (on ORDER BY),
|
|
> > it will break it! The requested ordering could be different
|
|
> > from what the choosen index might return. The used index is
|
|
> > choosen by the planner upon the qualifications given, not the
|
|
> > ordering wanted.
|
|
>
|
|
> I think it limits the final result. When query_limit is set,
|
|
> the arg "numberTuples" of ExecutePlan() is set to it instead of 0
|
|
> (this means no limit).
|
|
>
|
|
> Talking about "offset," it shouldn't be very difficult. I guess all we
|
|
> have to do is adding a new arg "offset" to ExecutePlan() then making
|
|
> obvious modifications. (and of course we have to modify set
|
|
> query_limit syntax but it's trivial)
|
|
|
|
The offset could become
|
|
|
|
FETCH n IN cursor [OFFSET n];
|
|
|
|
and
|
|
|
|
SELECT ... [LIMIT offset,count];
|
|
|
|
The FETCH command already calls ExecutorRun() with the given
|
|
count (the tuple limit). Telling it the offset too is really
|
|
simple. And ExecutorRun() could check if the toplevel
|
|
executor node is an index scan. Skipping tuples during the
|
|
index scan requires, that all qualifications are in the
|
|
indexqual, thus any tuple returned by it will become a final
|
|
result row (as it would be in the simple 1-table-queries we
|
|
discussed). If that isn't the case, the executor must
|
|
fallback to skip the final result tuples and that is after an
|
|
eventually processed sort/merge of the complete result set.
|
|
That would only reduce communication to the client and memory
|
|
required there to buffer the result set (not a bad thing
|
|
either).
|
|
|
|
ProcessQueryDesc() in tcop/pquery.c also calls ExecutorRun()
|
|
but with a constant 0 tuple count. Having offset and count in
|
|
the parsetree would make it without any state variables or
|
|
SET command. And it's the only clean way to restrict LIMIT to
|
|
SELECT queries. Any thrown in LIMIT to ExecutorRun() from
|
|
another place could badly hurt the rewrite system. Remember
|
|
that non-instead actions on insert/update/delete are
|
|
processed before the original query! And what about SQL
|
|
functions that get processed during the evaluation of another
|
|
query (view using an SQL function for count(*))?
|
|
|
|
A little better would it be to make the LIMIT values able to
|
|
be parameter nodes. C or PL functions use the prepared plan
|
|
feature of the SPI manager for performance reasons.
|
|
Especially the offset value might there need to be a
|
|
parameter that the executor has to pick out first. If we
|
|
change the count argument of ExecutorRun to a List *limit,
|
|
this one could be NIL (to mean the old 0 count 0 offset
|
|
behaviour) or a list of two elements that both can be either
|
|
a Const or a Param of type int4. Easy for the executor to
|
|
evaluate.
|
|
|
|
The only places where ExecutorRun() is called are
|
|
tcop/pquery.c (queries from frontend), commands/command.c
|
|
(FETCH command), executor/functions.c (SQL functions) and
|
|
executor/spi.c (SPI manager). So it is easy to change the
|
|
call interface too.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Thu Oct 15 14:32:34 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id OAA19803
|
|
for <maillist@candle.pha.pa.us>; Thu, 15 Oct 1998 14:32:31 -0400 (EDT)
|
|
Received: from hub.org (root@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id NAA10847 for <maillist@candle.pha.pa.us>; Thu, 15 Oct 1998 13:38:16 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id MAA22772;
|
|
Thu, 15 Oct 1998 12:07:20 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Thu, 15 Oct 1998 12:02:33 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id MAA22026
|
|
for pgsql-hackers-outgoing; Thu, 15 Oct 1998 12:02:31 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from ra.sai.msu.su (ra.sai.msu.su [158.250.29.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id MAA22007
|
|
for <hackers@postgreSQL.org>; Thu, 15 Oct 1998 12:02:16 -0400 (EDT)
|
|
(envelope-from oleg@sai.msu.su)
|
|
Received: from ra (ra [158.250.29.2])
|
|
by ra.sai.msu.su (8.9.1/8.9.1) with SMTP id TAA21024;
|
|
Thu, 15 Oct 1998 19:01:23 +0300 (MSK)
|
|
Date: Thu, 15 Oct 1998 20:01:23 +0400 (MSD)
|
|
From: Oleg Bartunov <oleg@sai.msu.su>
|
|
X-Sender: megera@ra
|
|
To: Jan Wieck <jwieck@debis.com>
|
|
cc: t-ishii@sra.co.jp, hackers@postgreSQL.org
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
In-Reply-To: <m0zTmRT-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
Message-ID: <Pine.GSO.3.96.SK.981015193853.19322D-100000@ra>
|
|
Organization: Sternberg Astronomical Institute (Moscow University)
|
|
MIME-Version: 1.0
|
|
Content-Type: TEXT/PLAIN; charset=US-ASCII
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: ROr
|
|
|
|
This is a little bit off-topic,
|
|
I did some timings with latest cvs on my real database
|
|
( all output redirected to /dev/null ), table contains 8798 records,
|
|
31 columns, order key have indices.
|
|
|
|
1.select count(*) from work_flats;
|
|
0.02user 0.00system 0:00.18elapsed 10%CPU (0avgtext+0avgdata 0maxresident)k
|
|
0inputs+0outputs (131major+21minor)pagefaults 0swaps
|
|
|
|
2.select * from work_flats order by rooms, metro_id;
|
|
2.35user 0.25system 0:10.11elapsed 25%CPU (0avgtext+0avgdata 0maxresident)k
|
|
0inputs+0outputs (131major+2799minor)pagefaults 0swaps
|
|
|
|
3.set query_limit to '150';
|
|
SET VARIABLE
|
|
select * from work_flats order by rooms, metro_id;
|
|
0.06user 0.00system 0:02.75elapsed 2%CPU (0avgtext+0avgdata 0maxresident)k
|
|
0inputs+0outputs (131major+67minor)pagefaults 0swaps
|
|
|
|
4.begin;
|
|
declare tt cursor for
|
|
select * from work_flats order by rooms, metro_id;
|
|
fetch 150 in tt;
|
|
end;
|
|
0.05user 0.01system 0:02.76elapsed 2%CPU (0avgtext+0avgdata 0maxresident)k
|
|
0inputs+0outputs (131major+67minor)pagefaults 0swaps
|
|
|
|
As you can see timings for query_limit and cursor are very similar,
|
|
I didn't expected this. So, in principle, enhanced version of fetch
|
|
(with offset) would cover all we need from LIMIT, but query_limit would be
|
|
still useful, for example to restrict loadness of server.
|
|
Will all enhancements you discussed go to the 6.4 ?
|
|
I'm really interested in testing this stuff because I begin new project
|
|
and everything we discussed here are badly needed.
|
|
|
|
|
|
Regards,
|
|
|
|
Oleg
|
|
|
|
|
|
|
|
On Thu, 15 Oct 1998, Jan Wieck wrote:
|
|
|
|
> Date: Thu, 15 Oct 1998 14:23:43 +0200 (MET DST)
|
|
> From: Jan Wieck <jwieck@debis.com>
|
|
> To: t-ishii@sra.co.jp
|
|
> Cc: jwieck@debis.com, oleg@sai.msu.su, hackers@postgreSQL.org
|
|
> Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
>
|
|
> Tatsuo Ishii wrote:
|
|
>
|
|
> > I think we have understanded your point. set query_limit is just a
|
|
> > easy alternative of using cursor and fetch.
|
|
> >
|
|
> > > I haven't looked at Tatsuo's patch very well. But if it
|
|
> > > limits the amount of data going into the sort (on ORDER BY),
|
|
> > > it will break it! The requested ordering could be different
|
|
> > > from what the choosen index might return. The used index is
|
|
> > > choosen by the planner upon the qualifications given, not the
|
|
> > > ordering wanted.
|
|
> >
|
|
> > I think it limits the final result. When query_limit is set,
|
|
> > the arg "numberTuples" of ExecutePlan() is set to it instead of 0
|
|
> > (this means no limit).
|
|
> >
|
|
> > Talking about "offset," it shouldn't be very difficult. I guess all we
|
|
> > have to do is adding a new arg "offset" to ExecutePlan() then making
|
|
> > obvious modifications. (and of course we have to modify set
|
|
> > query_limit syntax but it's trivial)
|
|
>
|
|
> The offset could become
|
|
>
|
|
> FETCH n IN cursor [OFFSET n];
|
|
>
|
|
> and
|
|
>
|
|
> SELECT ... [LIMIT offset,count];
|
|
>
|
|
> The FETCH command already calls ExecutorRun() with the given
|
|
> count (the tuple limit). Telling it the offset too is really
|
|
> simple. And ExecutorRun() could check if the toplevel
|
|
> executor node is an index scan. Skipping tuples during the
|
|
> index scan requires, that all qualifications are in the
|
|
> indexqual, thus any tuple returned by it will become a final
|
|
> result row (as it would be in the simple 1-table-queries we
|
|
> discussed). If that isn't the case, the executor must
|
|
> fallback to skip the final result tuples and that is after an
|
|
> eventually processed sort/merge of the complete result set.
|
|
> That would only reduce communication to the client and memory
|
|
> required there to buffer the result set (not a bad thing
|
|
> either).
|
|
>
|
|
> ProcessQueryDesc() in tcop/pquery.c also calls ExecutorRun()
|
|
> but with a constant 0 tuple count. Having offset and count in
|
|
> the parsetree would make it without any state variables or
|
|
> SET command. And it's the only clean way to restrict LIMIT to
|
|
> SELECT queries. Any thrown in LIMIT to ExecutorRun() from
|
|
> another place could badly hurt the rewrite system. Remember
|
|
> that non-instead actions on insert/update/delete are
|
|
> processed before the original query! And what about SQL
|
|
> functions that get processed during the evaluation of another
|
|
> query (view using an SQL function for count(*))?
|
|
>
|
|
> A little better would it be to make the LIMIT values able to
|
|
> be parameter nodes. C or PL functions use the prepared plan
|
|
> feature of the SPI manager for performance reasons.
|
|
> Especially the offset value might there need to be a
|
|
> parameter that the executor has to pick out first. If we
|
|
> change the count argument of ExecutorRun to a List *limit,
|
|
> this one could be NIL (to mean the old 0 count 0 offset
|
|
> behaviour) or a list of two elements that both can be either
|
|
> a Const or a Param of type int4. Easy for the executor to
|
|
> evaluate.
|
|
>
|
|
> The only places where ExecutorRun() is called are
|
|
> tcop/pquery.c (queries from frontend), commands/command.c
|
|
> (FETCH command), executor/functions.c (SQL functions) and
|
|
> executor/spi.c (SPI manager). So it is easy to change the
|
|
> call interface too.
|
|
>
|
|
>
|
|
> Jan
|
|
>
|
|
> --
|
|
>
|
|
> #======================================================================#
|
|
> # It's easier to get forgiveness for being wrong than for being right. #
|
|
> # Let's break this rule - forgive me. #
|
|
> #======================================== jwieck@debis.com (Jan Wieck) #
|
|
>
|
|
>
|
|
|
|
_____________________________________________________________
|
|
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
|
|
Sternberg Astronomical Institute, Moscow University (Russia)
|
|
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
|
|
phone: +007(095)939-16-83, +007(095)939-23-83
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Thu Oct 15 13:22:48 1998
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id NAA18540
|
|
for <maillist@candle.pha.pa.us>; Thu, 15 Oct 1998 13:22:46 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id MAA01819;
|
|
Thu, 15 Oct 1998 12:56:25 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Thu, 15 Oct 1998 12:51:43 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id MAA01305
|
|
for pgsql-hackers-outgoing; Thu, 15 Oct 1998 12:51:40 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id MAA01283
|
|
for <pgsql-hackers@postgresql.org>; Thu, 15 Oct 1998 12:51:28 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id SAA21874; Thu, 15 Oct 1998 18:54:00 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma021705; Thu, 15 Oct 98 18:53:31 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id SAA25226;
|
|
Thu, 15 Oct 1998 18:50:57 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id SAA30639;
|
|
Thu, 15 Oct 1998 18:53:14 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<jwieck@debis.com>>
|
|
id m0zToEf-000B5AC; Thu, 15 Oct 98 16:18 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for jwieck@debis.com
|
|
id m0zTqmM-000EBRC; Thu, 15 Oct 98 19:01 MET DST
|
|
Message-Id: <m0zTqmM-000EBRC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: hannu@trust.ee (Hannu Krosing)
|
|
Date: Thu, 15 Oct 1998 19:01:33 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, pgsql-hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <36261DF7.D20368A0@trust.ee> from "Hannu Krosing" at Oct 15, 98 07:08:23 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
Hannu Krosing wrote:
|
|
|
|
> Jan Wieck wrote:
|
|
> > The speedup for the cursor/fetch scenario is so impressive
|
|
> > that I'll create a post 6.4 patch. I don't want it in 6.4
|
|
> > because there is absolutely no query in the whole regression
|
|
> > test, where it suppresses the sort node.
|
|
>
|
|
> Good, then it works as expected ;)
|
|
>
|
|
> More seriously, it is not within powers of current regression test
|
|
> framework to test speed improvements (only the case where
|
|
> performance-wise bad implementation will actually crash the backend,
|
|
> as in the cnfify problem, but AFAIK we dont test for those now)
|
|
>
|
|
> > So we have absolutely no check that it doesn't break anything.
|
|
>
|
|
> If it did pass the regression, then IMHO it did not break anything.
|
|
|
|
Thats the point. The check if the sort node is required
|
|
returns TRUE for ALL queries of the regression. So the
|
|
behaviour when it returns FALSE is absolutely not tested.
|
|
|
|
>
|
|
> I would vote for putting it in (maybe with a
|
|
> 'set fix_optimiser_stupidity on' safeguard to enable it). I see no
|
|
> reason to postpone it to 6.4.1 and force almost everybody to first
|
|
> patch their copy and then upgrade very soon.
|
|
>
|
|
> I would even go far enough to call it a bugfix, as it does not really
|
|
> introduce any new functionality only fixes some existing functionality
|
|
> so that much bigger databases can be actually used.
|
|
|
|
I can't call it a bugfix because it is only a performance win
|
|
in some situations. And I feel the risk is too high to put
|
|
untested code into the backend at BETA2 time. The max we
|
|
should do is to take this one and the LIMIT thing (maybe
|
|
implemented as I suggested lately), and put out a Web-
|
|
Performance-Release at the same time we release 6.4.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
|
|
From Inoue@tpf.co.jp Thu Oct 15 20:31:01 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id UAA26050
|
|
for <maillist@candle.pha.pa.us>; Thu, 15 Oct 1998 20:31:00 -0400 (EDT)
|
|
Received: from sd.tpf.co.jp (sd.tpf.co.jp [210.161.239.34]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id UAA12888 for <maillist@candle.pha.pa.us>; Thu, 15 Oct 1998 20:10:03 -0400 (EDT)
|
|
Received: from cadzone ([126.0.1.40])
|
|
by sd.tpf.co.jp (2.0 Build 2131 (Berkeley 8.8.4)/8.8.4) with SMTP
|
|
id JAA02574; Fri, 16 Oct 1998 09:00:34 +0900
|
|
From: "Hiroshi Inoue" <Inoue@tpf.co.jp>
|
|
To: "Jan Wieck" <jwieck@debis.com>,
|
|
"Bruce Momjian" <maillist@candle.pha.pa.us>
|
|
Subject: RE: [HACKERS] What about LIMIT in SELECT ?
|
|
Date: Fri, 16 Oct 1998 09:12:55 +0900
|
|
Message-ID: <000201bdf899$b953bf00$2801007e@cadzone.tpf.co.jp>
|
|
MIME-Version: 1.0
|
|
Content-Type: text/plain;
|
|
charset="iso-8859-1"
|
|
Content-Transfer-Encoding: 7bit
|
|
X-Priority: 3 (Normal)
|
|
X-MSMail-Priority: Normal
|
|
X-Mailer: Microsoft Outlook 8.5, Build 4.71.2173.0
|
|
In-Reply-To: <199810150552.BAA07576@candle.pha.pa.us>
|
|
Importance: Normal
|
|
X-MimeOLE: Produced By Microsoft MimeOLE V4.72.2106.4
|
|
Status: ROr
|
|
|
|
> -----Original Message-----
|
|
> From: owner-pgsql-hackers@postgreSQL.org
|
|
> [mailto:owner-pgsql-hackers@postgreSQL.org]On Behalf Of Bruce Momjian
|
|
> Sent: Thursday, October 15, 1998 2:52 PM
|
|
> To: jwieck@debis.com
|
|
> Cc: lockhart@alumni.caltech.edu; jwieck@debis.com; eric@linux-hw.com;
|
|
> jeff@remapcorp.com; hackers@postgreSQL.org
|
|
> Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
>
|
|
>
|
|
> > > I have had more time to think about this. Basically, for pre-sorted
|
|
> > > data, our psort code is very fast, because it does not need to sort
|
|
> > > anything. It just moves the rows in and out of the sort memory. Yes,
|
|
> > > it could be removed in some cases, and probably should be,
|
|
> but it is not
|
|
> > > going to produce great speedups.
|
|
> >
|
|
> > And I got the time to hack around about this.
|
|
> >
|
|
> > I hacked in a little check into the planner, that compares
|
|
> > the sortClause against the key field list of an index scan
|
|
> > and just suppresses the sort node if it exactly matchs and
|
|
> > all sort operators are "<".
|
|
> >
|
|
> > I tested with a 10k row table where key is a text field. The
|
|
> > base query is a
|
|
> >
|
|
> > SELECT ... WHERE key > 'val' ORDER BY key;
|
|
> >
|
|
> > The used 'val' is always a key that is close to the first of
|
|
> > all keys in the table ('' on the first query and the last
|
|
> > selected value on subsequent ones).
|
|
>
|
|
> This is good stuff. I want to think about it for a day. Sounds very
|
|
> promising.
|
|
>
|
|
|
|
Did you see my contribution about this subject ?
|
|
I have already implemented above cases and used on trial for three
|
|
months or more.
|
|
It is good to be formally supported by PostgreSQL community.
|
|
|
|
And please remember that there are descending order cases.
|
|
(Moreover there are compound cases such as
|
|
SELECT * from ... order by key1 desc,key2 asc;
|
|
I didn't implement such cases.)
|
|
|
|
Thanks.
|
|
|
|
Hiroshi Inoue
|
|
Inoue@tpf.co.jp
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Fri Oct 16 04:01:07 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id EAA02029
|
|
for <maillist@candle.pha.pa.us>; Fri, 16 Oct 1998 04:01:04 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id DAA05509 for <maillist@candle.pha.pa.us>; Fri, 16 Oct 1998 03:43:53 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id CAA11278;
|
|
Fri, 16 Oct 1998 02:00:01 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Fri, 16 Oct 1998 01:57:25 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id BAA11129
|
|
for pgsql-hackers-outgoing; Fri, 16 Oct 1998 01:57:21 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from candle.pha.pa.us (root@s5-03.ppp.op.net [209.152.195.67])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id BAA11116
|
|
for <hackers@postgreSQL.org>; Fri, 16 Oct 1998 01:57:00 -0400 (EDT)
|
|
(envelope-from maillist@candle.pha.pa.us)
|
|
Received: (from maillist@localhost)
|
|
by candle.pha.pa.us (8.9.0/8.9.0) id BAA29942;
|
|
Fri, 16 Oct 1998 01:34:33 -0400 (EDT)
|
|
From: Bruce Momjian <maillist@candle.pha.pa.us>
|
|
Message-Id: <199810160534.BAA29942@candle.pha.pa.us>
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
In-Reply-To: <m0zTY6V-000EBRC@orion.SAPserv.Hamburg.dsh.de> from Jan Wieck at "Oct 14, 1998 11: 5: 7 pm"
|
|
To: jwieck@debis.com
|
|
Date: Fri, 16 Oct 1998 01:34:33 -0400 (EDT)
|
|
Cc: lockhart@alumni.caltech.edu, jwieck@debis.com, eric@linux-hw.com,
|
|
jeff@remapcorp.com, hackers@postgreSQL.org
|
|
X-Mailer: ELM [version 2.4ME+ PL47 (25)]
|
|
MIME-Version: 1.0
|
|
Content-Type: text/plain; charset=US-ASCII
|
|
Content-Transfer-Encoding: 7bit
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
OK, I have had my day of thinking, and will address this specific
|
|
posting first, because it is the most fundamental concerning the future
|
|
direction of the optimization.
|
|
|
|
>
|
|
> And I got the time to hack around about this.
|
|
>
|
|
> I hacked in a little check into the planner, that compares
|
|
> the sortClause against the key field list of an index scan
|
|
> and just suppresses the sort node if it exactly matchs and
|
|
> all sort operators are "<".
|
|
>
|
|
> I tested with a 10k row table where key is a text field. The
|
|
> base query is a
|
|
>
|
|
> SELECT ... WHERE key > 'val' ORDER BY key;
|
|
>
|
|
> The used 'val' is always a key that is close to the first of
|
|
> all keys in the table ('' on the first query and the last
|
|
> selected value on subsequent ones).
|
|
>
|
|
> Scenario 1 (S1) uses exactly the above query but processes
|
|
> only the first 20 rows from the result buffer. Thus the
|
|
> frontend receives nearly the whole table.
|
|
|
|
OK.
|
|
|
|
>
|
|
> Scenario 2 (S2) uses a cursor and FETCH 20. But closes the
|
|
> cursor and creates a new one for the next selection (only
|
|
> with another 'val') as it would occur in a web application.
|
|
>
|
|
> If there is no index on key, the backend will allways do a
|
|
> Sort->SeqScan and due to the 'val' close to the lowest
|
|
> existing key nearly all tuples get scanned and put into the
|
|
> sort. S1 here runs about 10 seconds and S2 about 6 seconds.
|
|
> The speedup in S2 results from the reduced overhead of
|
|
> sending not wanted tuples into the frontend.
|
|
|
|
Makes sense. All rows are processed, but not sent to client.
|
|
|
|
>
|
|
> Now with a btree index on key and an unpatched backend.
|
|
> Produced plan is always a Sort->IndexScan. S1 needs 16
|
|
> seconds and S2 needs 12 seconds. Again nearly all data is put
|
|
> into the sort but this time over the index scan and that is
|
|
> slower.
|
|
|
|
VACUUM ANALYZE could affect this. Because it had no stats, it thought
|
|
index use would be faster, but in fact because 'val' was near the lowest
|
|
value, it as selecting 90% of the table, and would have been better with
|
|
a sequential scan. pg_statistics's low/hi values for a column could
|
|
have told that to the optimizer.
|
|
|
|
I know the good part of the posting is coming.
|
|
|
|
> Last with the btree index on key and the patched backend.
|
|
> This time the plan is a plain IndexScan because the ORDER BY
|
|
> clause exactly matches the sort order of the chosen index.
|
|
> S1 needs 13 seconds and S2 less than 0.2! This dramatic
|
|
> speedup comes from the fact, that this time the index scan is
|
|
> the toplevel executor node and the executor run is stopped
|
|
> after 20 tuples have been selected.
|
|
|
|
OK, seems like in the S1 case, the use of the psort/ORDER BY code on top
|
|
of the index was taking and extra 3 seconds, which is 23%. That is a
|
|
lot more than I thought for the psort code, and shows we could gain a
|
|
lot by removing unneeded sorts from queries that are already using
|
|
matching indexes.
|
|
|
|
Just for clarity, added to TODO. I think everyone is clear on this one,
|
|
and its magnitude is a surprise to me:
|
|
|
|
* Prevent psort() usage when query already using index matching ORDER BY
|
|
|
|
|
|
> Analysis of the above timings:
|
|
>
|
|
> If there is an ORDER BY clause, using an index scan is the
|
|
> clever way if the indexqual dramatically reduces the the
|
|
> amount of data selected and sorted. I think this is the
|
|
> normal case (who really selects nearly all rows from a 5M row
|
|
> table?). So choosing the index path is correct. This will
|
|
> hurt if someone really selects most of the rows and the index
|
|
> scan jumps over the disc. But here the programmer should use
|
|
> an unqualified query to perform a seqscan and do the
|
|
> qualification in the frontend application.
|
|
|
|
Fortunately, the optimizer already does the index selection for us, and
|
|
guesses pretty well if the index or sequential scan is better. Once we
|
|
implement the above removal of psort(), we will have to change the
|
|
timings because now you have to compare index scan against sequential
|
|
scan AND psort(), because in the index scan situation, you don't need
|
|
the psort(), assuming the ORDER BY matches the index exactly.
|
|
|
|
> The speedup for the cursor/fetch scenario is so impressive
|
|
> that I'll create a post 6.4 patch. I don't want it in 6.4
|
|
> because there is absolutely no query in the whole regression
|
|
> test, where it suppresses the sort node. So we have
|
|
> absolutely no check that it doesn't break anything.
|
|
>
|
|
> For a web application, that can use a unique key to select
|
|
> the next amount of rows, it will be a big win.
|
|
|
|
OK, I think the reason the regression test did not show your code being
|
|
used is important.
|
|
|
|
First, most of the tables are small in the regression test, so sequential
|
|
scans are faster. Second, most queries using indexes are either joins,
|
|
which do the entire table, or equality tests, like col = 3, where there
|
|
is no matching ORDER BY because all the col values are 3. Again, your
|
|
code can't help with these.
|
|
|
|
The only regression-type code that would use it would be a 'col > 3'
|
|
qualification with a col ORDER BY, and there aren't many of those.
|
|
|
|
However, if we think of the actual application you are addressing, it is
|
|
a major win. If we are going after only one row of the index, it is
|
|
fast. If we are going after the entire table, it is faster to
|
|
sequential scan and psort(). You big win is with the partial queries,
|
|
where you end up doing a full sequential scan or index scan, then and
|
|
ORDER BY, while you really only need a few rows from the query, and if
|
|
you deal directly with the index, you can prevent many rows from being
|
|
processed. It is the ability to skip processing those extra rows that
|
|
makes it a big win, not so much the removal of the ORDER BY, though that
|
|
helps too.
|
|
|
|
Your solution really is tailored for this 'partial' query application,
|
|
and I think it is a big need for certain applications that can't use
|
|
cursors, like web apps. Most other apps have long-time connections to
|
|
the database, and are better off with cursors.
|
|
|
|
I did profiling to improve startup time, because the database
|
|
requirements of web apps are different from normal db apps, and we have
|
|
to adjust to that.
|
|
|
|
So, to reiterate, full queries are not benefited as much from the new
|
|
code, because sequential scan/psort is faster, or because the index only
|
|
retrieves a small number of rows because the qualification of values is
|
|
very specific.
|
|
|
|
Those open-ended, give me the rows from 100 to 199 really need your
|
|
modifications.
|
|
|
|
OK, we have QUERY_LIMIT, and that allows us to throw any query at the
|
|
system, and it will return that many of the first rows for the ORDER BY.
|
|
No fancy stuff required. If we can get a matching index, we may be able
|
|
to remove the requirement of scanning all the row (with Jan's patch),
|
|
and that is a big win. If not, we at least prevent the rows from being
|
|
returned to the client.
|
|
|
|
However, there is the OFFSET issue. This is really a case where the
|
|
user wants to _restart_ the query where they left off. That is a
|
|
different problem. All of a sudden, we need to evaluate more of the
|
|
query, and return a segment from the middle of the result set.
|
|
|
|
I think we need to decide how to handle such a restart. Do we
|
|
re-evaluate the entire query, skipping all the rows up to OFFSET, and
|
|
return the number of rows they requested after OFFSET. I would think we
|
|
don't want to do that, do we. It would be much easier to code. If it
|
|
is a single table, skipping forward has to be done anyway, because we
|
|
can't just _jump_ to the 100th entry in the index, unless we pass some
|
|
_tid_ to the user, and expect them to pass that back to start the query.
|
|
I don't think we went to do that. It is ugly, and the row may have
|
|
moved since we started. So, for a single table, adding a QUERY_OFFSET
|
|
would do exactly what we need, with Jan's patches.
|
|
|
|
For a joined query, I think you will have to do the entire _join_ before
|
|
returning anything.
|
|
|
|
You can't just process all the joins up to the OFFSET location, and you
|
|
can't just jump to the 100th index location, because you don't know that
|
|
the 100th index location produced the 100th result just returned to the
|
|
user. You have to process the whole query, and because of the join and
|
|
not knowing which data row from each table is going to make which entry
|
|
in the final result. If you are really craft, and the ORDER BY table is
|
|
in the outer part of the join loop, you could start processing the table
|
|
that is part of the outer loop in _index_ order, because you know that
|
|
the rows processed in index order are going to produce the output in
|
|
result order. You then could process and throw away the results up to
|
|
offset, and generate the needed rows and stop.
|
|
|
|
The other way of doing it is to specify a query limit based on specific
|
|
index entries, so you say I want the query returned by the first 20
|
|
index entries matching the ORDER BY, or entries 100-199, and the query
|
|
is limited to using only those entries in the index. In that case,
|
|
though, in joins, you could return more or less rows in the result
|
|
depending on the other tables, and that may be unacceptable. However,
|
|
for this case, the advantage is that you don't need to process the rows
|
|
from 1 to 99 because you have been told the user only wants rows from
|
|
certain index slots. If the user requests rows 50000-50100, this would
|
|
be much faster because you don't have to process the 50000 rows before
|
|
returning any data. However, I question how often people grab stuff
|
|
from the center of large data sets. Seems the QUERY_OFFSET idea may be
|
|
easier for users.
|
|
|
|
I will be commenting on the rest of the optimization postings tomorrow.
|
|
|
|
--
|
|
Bruce Momjian | http://www.op.net/~candle
|
|
maillist@candle.pha.pa.us | (610) 853-3000
|
|
+ If your life is a hard drive, | 830 Blythe Avenue
|
|
+ Christ can be your backup. | Drexel Hill, Pennsylvania 19026
|
|
|
|
|
|
From Inoue@tpf.co.jp Fri Oct 16 03:31:02 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id DAA01767
|
|
for <maillist@candle.pha.pa.us>; Fri, 16 Oct 1998 03:31:00 -0400 (EDT)
|
|
Received: from sd.tpf.co.jp (sd.tpf.co.jp [210.161.239.34]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id DAA04551 for <maillist@candle.pha.pa.us>; Fri, 16 Oct 1998 03:13:40 -0400 (EDT)
|
|
Received: from cadzone ([126.0.1.40])
|
|
by sd.tpf.co.jp (2.0 Build 2131 (Berkeley 8.8.4)/8.8.4) with SMTP
|
|
id QAA02680; Fri, 16 Oct 1998 16:04:09 +0900
|
|
From: "Hiroshi Inoue" <Inoue@tpf.co.jp>
|
|
To: "Bruce Momjian" <maillist@candle.pha.pa.us>
|
|
Cc: <jwieck@debis.com>
|
|
Subject: RE: [HACKERS] What about LIMIT in SELECT ?
|
|
Date: Fri, 16 Oct 1998 16:16:29 +0900
|
|
Message-ID: <000001bdf8d4$e4cdf520$2801007e@cadzone.tpf.co.jp>
|
|
MIME-Version: 1.0
|
|
Content-Type: text/plain;
|
|
charset="iso-8859-1"
|
|
Content-Transfer-Encoding: 7bit
|
|
X-Priority: 3 (Normal)
|
|
X-MSMail-Priority: Normal
|
|
X-Mailer: Microsoft Outlook 8.5, Build 4.71.2173.0
|
|
Importance: Normal
|
|
X-MimeOLE: Produced By Microsoft MimeOLE V4.72.2106.4
|
|
In-Reply-To: <199810160621.CAA01030@candle.pha.pa.us>
|
|
Status: RO
|
|
|
|
Where's my contibution to hackers@potsgreSQL.org ?
|
|
I will resend it.
|
|
|
|
> -----Original Message-----
|
|
> From: Bruce Momjian [mailto:maillist@candle.pha.pa.us]
|
|
> Sent: Friday, October 16, 1998 3:22 PM
|
|
> To: Hiroshi Inoue
|
|
> Cc: jwieck@debis.com
|
|
> Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
>
|
|
>
|
|
> [Charset iso-8859-1 unsupported, filtering to ASCII...]
|
|
> > > > The used 'val' is always a key that is close to the first of
|
|
> > > > all keys in the table ('' on the first query and the last
|
|
> > > > selected value on subsequent ones).
|
|
> > >
|
|
> > > This is good stuff. I want to think about it for a day. Sounds very
|
|
> > > promising.
|
|
> > >
|
|
> >
|
|
> > Did you see my contribution about this subject ?
|
|
>
|
|
> I am sorry. I have not seen it, and I am confused how I could have
|
|
> missed it.
|
|
>
|
|
> > I have already implemented above cases and used on trial for three
|
|
> > months or more.
|
|
> > It is good to be formally supported by PostgreSQL community.
|
|
> >
|
|
> > And please remember that there are descending order cases.
|
|
> > (Moreover there are compound cases such as
|
|
> > SELECT * from ... order by key1 desc,key2 asc;
|
|
> > I didn't implement such cases.)
|
|
>
|
|
> Where is the discussion of this? I am confused. You have been using
|
|
> code for three months that does this?
|
|
>
|
|
|
|
Hi all.
|
|
I didn't follow all the posts about this thread.
|
|
So this post may be out of center.
|
|
|
|
I think current PostgreSQL lacks the concern to the response to get first
|
|
rows quickly.
|
|
For example,queries with ORDER BY clause necessarily include sort steps
|
|
and process all target rows to get first rows only.
|
|
So I modified my code for ORDER BY cases and use on trial.
|
|
I don't understand PostgreSQL sources,so my code is not complete.
|
|
|
|
I modified my code for the following 2 cases.
|
|
|
|
1.In many cases the following query uses index scan.
|
|
SELECT * from ... where key > ...; (where (key) is an index)
|
|
If so,we can omit sort steps from the access plan for the following
|
|
query.
|
|
SELECT * from ... where key > ... order by key;
|
|
|
|
Currently cursors without sort steps may be sensitive diffrent from
|
|
cursors with sort steps. But no one mind it.
|
|
|
|
2.In many cases the following query uses index scan same as case 1.
|
|
SELECT * from ... where key < ...;(where (key) is an index)
|
|
If so and if we scan the index backward,we can omit sort steps from
|
|
the access plan for the following query.
|
|
SELECT * from ... where key < ... order by key desc;
|
|
|
|
To achive this(backward scan),I used hidden(provided for the future ?)code
|
|
that is never executed and is not necessarily correct.
|
|
|
|
In the following cases I didn't modify my code to use index scan,
|
|
because I couldn't formulate how to tell PostgreSQL optimizer whether
|
|
the response to get first rows is needed or the throughput to process
|
|
sufficiently many target rows is needed.
|
|
|
|
3.The access plan made by current PostgreSQL optimizer for a query with
|
|
ORDER BY clause doesn't include index scan.
|
|
|
|
I thought the use of Tatsuo's QUERY_LIMIT to decide that the responce
|
|
is needed. It is sufficient but not necessary ?
|
|
In Oracle the hints FIRST_ROWS,ALL_ROWS are used.
|
|
|
|
Thanks.
|
|
|
|
Hiroshi Inoue
|
|
Inoue@tpf.co.jp
|
|
|
|
|
|
From wieck@sapserv.debis.de Fri Oct 16 05:01:03 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id FAA02500
|
|
for <maillist@candle.pha.pa.us>; Fri, 16 Oct 1998 05:01:02 -0400 (EDT)
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id EAA06270 for <maillist@candle.pha.pa.us>; Fri, 16 Oct 1998 04:13:59 -0400 (EDT)
|
|
Received: by dsh.de; id KAA11635; Fri, 16 Oct 1998 10:12:45 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma011343; Fri, 16 Oct 98 10:12:15 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id KAA21793;
|
|
Fri, 16 Oct 1998 10:09:49 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id KAA01799;
|
|
Fri, 16 Oct 1998 10:12:11 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zU2aB-000B5AC; Fri, 16 Oct 98 07:37 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zU57w-000EBQC; Fri, 16 Oct 98 10:20 MET DST
|
|
Message-Id: <m0zU57w-000EBQC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: Inoue@tpf.co.jp (Hiroshi Inoue)
|
|
Date: Fri, 16 Oct 1998 10:20:47 +0200 (MET DST)
|
|
Cc: maillist@candle.pha.pa.us, jwieck@debis.com
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <000001bdf8d4$e4cdf520$2801007e@cadzone.tpf.co.jp> from "Hiroshi Inoue" at Oct 16, 98 04:16:29 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Status: RO
|
|
|
|
Hiroshi Inoue wrote:
|
|
|
|
> In the following cases I didn't modify my code to use index scan,
|
|
> because I couldn't formulate how to tell PostgreSQL optimizer whether
|
|
> the response to get first rows is needed or the throughput to process
|
|
> sufficiently many target rows is needed.
|
|
>
|
|
> 3.The access plan made by current PostgreSQL optimizer for a query with
|
|
> ORDER BY clause doesn't include index scan.
|
|
>
|
|
> I thought the use of Tatsuo's QUERY_LIMIT to decide that the responce
|
|
> is needed. It is sufficient but not necessary ?
|
|
> In Oracle the hints FIRST_ROWS,ALL_ROWS are used.
|
|
|
|
I still think that the QUERY LIMIT should be part of the
|
|
parse tree and not thrown in by a magic SET command. If
|
|
rewriting or function calls turn the one query sent to the
|
|
backend into multiple queries processed internal, how should
|
|
this QUERY LIMIT variable know to which of all the queries it
|
|
has to be applied? It can really break functions and rewrite
|
|
rules if this variable is used on all queries while it is
|
|
set.
|
|
|
|
For your case 3 I think, if there is a QUERY LIMIT in the
|
|
parse tree, the (future) optimizer definitely knows that not
|
|
all rows will get processed even if there is no qualification
|
|
given. So if there is an index, that matches the ORDER BY
|
|
clause and it is no a join and the (future) executor handles
|
|
OFFSET in single table index scans fast, it could choose an
|
|
index scan for this query too.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Fri Oct 16 12:02:27 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id MAA13063
|
|
for <maillist@candle.pha.pa.us>; Fri, 16 Oct 1998 12:02:23 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id MAA18435 for <maillist@candle.pha.pa.us>; Fri, 16 Oct 1998 12:01:46 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id LAA24469;
|
|
Fri, 16 Oct 1998 11:28:54 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Fri, 16 Oct 1998 11:25:54 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id LAA24370
|
|
for pgsql-hackers-outgoing; Fri, 16 Oct 1998 11:25:52 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id LAA24356
|
|
for <pgsql-hackers@postgreSQL.org>; Fri, 16 Oct 1998 11:25:34 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id RAA06506; Fri, 16 Oct 1998 17:28:04 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma006149; Fri, 16 Oct 98 17:27:12 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id RAA00811
|
|
for <pgsql-hackers@postgreSQL.org>; Fri, 16 Oct 1998 17:24:37 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id RAA04532
|
|
for <pgsql-hackers@postgreSQL.org>; Fri, 16 Oct 1998 17:26:54 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<pgsql-hackers@postgreSQL.org>>
|
|
id m0zU9N0-000B5AC; Fri, 16 Oct 98 14:52 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for pgsql-hackers@postgreSQL.org
|
|
id m0zUBum-000EBQC; Fri, 16 Oct 98 17:35 MET DST
|
|
Message-Id: <m0zUBum-000EBQC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: [HACKERS] SELECT ... LIMIT (trial implementation)
|
|
To: pgsql-hackers@postgreSQL.org (PostgreSQL HACKERS)
|
|
Date: Fri, 16 Oct 1998 17:35:39 +0200 (MET DST)
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: ROr
|
|
|
|
Here we go,
|
|
|
|
this is up to now only for discussion, do not apply to CVS!
|
|
|
|
Those involved into the LIMIT discussion please comment.
|
|
|
|
Here is what I had in mind for the SELECT ... LIMIT. It adds
|
|
|
|
SELECT ... [LIMIT count [, offset]]
|
|
|
|
to the parser and arranges that these values are passed down
|
|
to the executor.
|
|
|
|
It is a clean implementation of LIMIT (regression tested) and
|
|
the open items on it are to enable parameters and handle it
|
|
in SQL functions and SPI stuff (currently ignored in both).
|
|
Optimizing the executor would require the other sort node
|
|
stuff discussion first to come to a conclusion. For now it
|
|
skips final result rows - but that's already one step forward
|
|
since it reduces the rows sent to the frontend to exactly
|
|
that what LIMIT requested.
|
|
|
|
I've seen the queryLimit by SET variable stuff and that
|
|
really can break rewrite rules, triggers or functions. This
|
|
is because the query limit will be inherited by any query
|
|
(inserts, updates, deletes too) done by them. Have a rule for
|
|
constraint deletes of referencing tuples
|
|
|
|
CREATE RULE del_table1 AS ON DELETE TO table1 DO
|
|
DELETE FROM table2 WHERE ref = OLD.key;
|
|
|
|
If the user now sets the query limit to 1 via SET and deletes
|
|
a row from table1, only the first found record in table2 will
|
|
be constraint deleted, not all of them.
|
|
|
|
This is a feature where users can get around rules that
|
|
ensure data integrity.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
begin 644 opt_limit.diff.gz
|
|
M'XL(`$]=)S8"`^4\:W?B1K*?R:_H82=98&1;$F^<>`^+F1DV&!S`N<DGCBP:
|
|
M6SM"(I*PQSOQ?[]5_9!:(`$S<1Y[+V<&I.[JZJ[NZGIUM1?.<DE.[("$@7WJ
|
|
M!\[=V:UE?Z#>XLSV5RO+6X3RX=1&F#W57U4JE6/0%-X&#AG;$3$:Q#`Z]6JG
|
|
MVB)&N]WZZN3DY$`?J;9FI][HU&J\;27]88.IMC6H9J^(&E_K!)Y.OB+D;XYG
|
|
MNYL%)<5-Y+CAV<K^&)W>%S-JUGX066YV7?@4VI9]3['VC5(;1H'CW?$V\.^L
|
|
MPKI5/U!**H7"-4/>_TCM3>0'[ZFUOJ(K/W@B8;19+C.I,G1#,_1&0I>AFU#0
|
|
MDI05'"\J%`I+:D6;@)YCR0\;&CQ=TM"&/G^1SZR&]];SO8A^C(C-?\^!E@*4
|
|
MA8#'=5:VO_&B<TY)(9N4`M`26IX3/1&8#?M#F#URHZH91DL9N5'7C&HM'OGS
|
|
MGD[>L$YZ`9!%B85##2/B^3#;R\!?D>B>DCOG@7J$#9<\6.Z&\D8YN,[P9T57
|
|
M(8U*WT@R-:)K)'3^0_UE29:5RVQ"Y.MI]+2FA<)W9#9G<Y2N9.-B$-^1P6A6
|
|
M&P\N,P!<ZD&]Z`?6JU;.`&(D`%CI$E9R51;+L`/FA-[&=0'N;7<X[6<`W#X!
|
|
M)ASOY*:?V1YF8$]S!U@\3`$@3#X?W-&(K0;?-\GJB'?/6E$.F8/@+)MWF@W-
|
|
M:)F<=P`N=^/`?*68NHS0`OP=C1+($A]0F7'V*U*0N"8;KQ1O$HW$+:<1<)YL
|
|
MI!&QOS3.;V6Y/YPE*2THL.9WWY&1[]$R;$68JZOQCWTD#<>R7@:4)EU`4[87
|
|
MVK"+V\DN_NL06!KA+JN41S?#8?(6;YG?1OHB5P%1,5SV<&4YWI8&RJC/5D$9
|
|
M@,?KH$.-&QU#[]1;^4JH4=,:[43DP6M3EXL<PHP[-GGPG07I>XMKU_)*^$4J
|
|
M:_C62)\M":E0!*1LHD63V6;MTIEUZ]*IZT>DPM>6,@SI5K#`"4:V"%P=0(D5
|
|
MA'0&"Z*1WFHQ`YE%_#4-`+\/D&\0$D03\9=+$!"LPU`@P&)OL[JE@2@F4]OR
|
|
M+IV`VMB8+.23@&<$EBIK4(K1<N/995(JJ]3P"0`2)A04)WV@I6WZ0OC62!8>
|
|
M+5M<-$#5R'E'5;LC;#[WP[`@%V\/#;:6NK,275M1-MGVFN`,QAL,7_A6`O2?
|
|
M<,K$>A0*\8(D>KR``XG73@@/8"LCX:N_-KE<@(#XX)RE%'SY)&2K##"4FJJA
|
|
MU`01VVS'(G:;M("&&Y=9.F#],(OS$L49RC16B.Q7*)0J6.!X;$3(@4QG,J.+
|
|
M$P026D_*N#G"BJ1QL_,YWH)J`TEME:1V33/K"4E\PD\N:#A?![Y-0;<O9.=J
|
|
MI0LJ'3<3&"D>6`?.8NPLV`#?9`^06U\@X;F-)76\(-A?LK?AX&HP([9K;4)A
|
|
M>N4@8J87JHN8"<@KT!F@7\I8\XD)'V&`5FP.P684QL9^KJW`6D'=6JUC:-E3
|
|
M^.A$]CTIH7DXL^Z2;LH,O^B@`'8-E59<AQ<5['@-2ZP<-%W<^ES`)""PPN62
|
|
M:')RD5AM90EZ"_;J!VDSQ3VRX<L>J>O?E?J3R7BBD>(::VA$`V*%!#IVI/P%
|
|
M4S<BS@H8=D5![2_($XV*>=TLZ-("3L[L8>/1CVN0SH""6<]HJY*O%]O]%362
|
|
M,7N\OV?9$2ZA&-ZW1.=SF^XM10+H"*3BEA*/WL'V>:""@N?8J#R:\_BV^BV,
|
|
MQS'LX;O$X-YA.Z5J+]<)_^$@TTDAH?*<TD<B103'L=??B^%X7W\8O['NTNRF
|
|
MNEUI;N-CV\ML'&0_KX$PE*LEU!*SHE'MY,C=MF;J]=C[$!^B&%6B(+:IU%+%
|
|
MK'J5E':'P_GLYGK8GZ:`W_K!HQ4L4A95"D!5/^>\0JP':@2S86IF(]$(+S)4
|
|
MH:G5(NXNO]#`L^;<U%N::50376>"^C:;K1>@[,TN9>0%*<NDQH3!FTV%FE93
|
|
M,]O5/X&:?X)K\YO)J<+H:Z827-.K6E5OORPY>@IF/.KS_?+2:],`JZJA6E7-
|
|
M>DMK-FH*,8=]*%[\6QTI7OP%WE0F8<V:J35K"M.U#%UKF<HJ?>(HN>5+N"Y,
|
|
M;\V4",Y1T*BA81U`A#^21]3-J#<BZCX1[CY3'@#".02CFH0?G+5L)GJ.&/D$
|
|
M1'Q`P8\(45W</@G%SB?N-.X)5M,%W-!3>.]OW`6)8"DBGRQ\`O+?@1FC%.:?
|
|
MN.!)NJ%L!KC7?A@Z8.Z3TF*#`5J`7="/)(3I+DNP$_(ORQ,O><2>;5D_?/'(
|
|
MA51+0M6?G*C50G=BC-7Q-CPP&T<\,V?VB-@(KC^R1I@7'%$`#D1'%,@O"(_D
|
|
MM:YV=*.CF_GQ$5/70;4VI&K%R1`J&<V=87<*ULMX.OOAIC^<]\975]W198F&
|
|
MY3+Y!^G_U._-)_T9"`;2$6\W(QGN"B@@\5+.*@U/+GY9:`1_I6L:NZ4ZTTEB
|
|
M093`!)?68I1BX_QYH\P.QC$3-F?X1W!1N';R^(=5'>`<!I/!,^T#/+/;#KBE
|
|
MU3';^=S2!%'=3(EJ5M",)=JM[[NP-4,921IXD3^A+I/%L%1+RPWYUK/OK8!%
|
|
M$4"*^B.P@57IQX]0`K%I<XY"I`49RWIA0V:-NV76M5955R2QV=!:R7$4ZXUQ
|
|
MPO1Z,&<&[7Q\?3/Z?C3^'\8K7%*\(?*3?402?RK\YXCCDBA]7J)@V(O_3'D[
|
|
M_A!%?E)G*0E6DCI4V0&.SU8DL'+`D@WL`H$2>/NT);L%FP?10AR\1+$GEMV$
|
|
M'\#P)O$9238H.XJ1E(KCF#RDZ#$?A90?T&R!RH`\DQ\L(H\B@D?DF2)>!Q1]
|
|
M/[8O1'1><"!C>SSK7()?Q_D1A-;-#%AR.NO.IMD,7@,&KQN)AZ2P\_C[>>]F
|
|
M,AU/$D;./XX0`H_)R;?HUD7I4X8Y8K0W04#1`U:#7!FAKW,Y#?$>Q:GH75W.
|
|
MI_UAOS<CWWQ#XKY/+N3I!711YKNT`?92HY'LTA>CZMA#E3^,W'PU@5(#S^#7
|
|
M3ZCDMRV-G=IL9;$#MJLO:GJVOMC?%/2,V:G6\U6&46^V-*/>4CQ)6:0LK$<?
|
|
ML9^3BXT'T]9CH228Y(BNUG/7$0%@(8,+N%H@I]9/)12F&A&--1[1&/-@V?EA
|
|
MR)Y8:KY=!6\)D%B;'UH8?Q/EKTM2N6]9$JC/796<EK`H>J>N[['ZZFW-;"C)
|
|
M&;P@<>ZM]1IZF;(\BH&W]$&0!1J/K9U<W%OA=',[=+P/(1A8Q2C8T"+85D6F
|
|
MW8O\:"D'`\"I*USDL'.@`Y>*P^PP`E_+7(3*JA,19<I"N,,<^Q$RYCB(KQ=+
|
|
MB\\R_?CJ@76PR.<=I78?\RA@G\L]>4V!?8Q.K;IG3[/C/363!`N:B<<>^=`-
|
|
M[%\W7,-CB4O9;\`<N(ONR^=,"Z(7FF8&K@9=W[;<>98PP+()#+F$#%>6QR5'
|
|
M=Z6R"7<AU:[4VMVNWGQ!5YR!<GKJB8!R1D>)*$I:'2&-6"@D.+/`.7_Z#]WB
|
|
MJ.W*;(;:ALK@)R.;G_:VK()+T3&;>]BII6M&2SU6PX*VKKI[,I0^I2ZUHVFT
|
|
MBCHBA@/Z]E6IE)2#'D^B7&50VBRC`O->1-!:!%V2J$L46%ZX](-5@J2T%B9#
|
|
M+F(9B^=8MAEH:T#J>!3`3!R2-0ZAZ"D'%,^<)`KR-Y>XWB8(_>!8XM08U$&F
|
|
MNPNLU>E3%D?PFKWLQD$R>,W<RVL9S>H=L[HW(Q&,Y+KB!.*KL9N1:%O`+_[=
|
|
M&?PBUV2G'N+)QW:-9:,=>/;1LJ/MC$2)<WTW1P\J3DT45O[5S7`V^.?/LWX*
|
|
MW^H6X1_15<8&1R1:%`JPQ-&<'\6!V0LOMT_,@-)XL&W.#IQ"^<9KD$K^R%"@
|
|
MJ12C8"\<S%]'<RL(K*?Y+;#>(D1S*MPJ8@CHQW4@VEA1%``@<.,\LH([&HGR
|
|
MK0*3GVW`5"A]P1#CT"NY"_S-.A[5#/CHC@9O07EU@[N0S^77S#?]%N7E10'U
|
|
MVIS+T:0*HQ/4\BX*L/?FT(62O&$DMNC_RRGDK;%-R&3"G,F8+Y[8[%,\3+%K
|
|
M;F5/Z(8BY@4Z!]QTZ(E&]OW\WG^<KRSOB:O[U%#4D<YE!"7!@M-P41BOHRG]
|
|
M90C/20U,$G7CNKY+5WDGCV:CJ9G-:K*_F(<_O];(V_ZL]QY^AN/N#'[0J82O
|
|
M_N#="!XFXROX1ON`-7HWZ8X`Z-UD?`,MWW=_'(S>P>_X9@*8&,0`6@U&HSX6
|
|
MP,.T/YH.9H,?^_QE,L/?67_R8W?(GL;P/>5+]J\QMOV^_[-&AMW1NYON.V@T
|
|
M['<O61_#_EMH.QQ\CX7C7E<,Z*K+!G\U&-W,9X]'L/3Z,NE?R0'34G0W&
|
|
M(^P1GFXF[*'WO@N$COH_`=;1&/_C`Q#*VMY<]2>#'F\]?JN1\0C_#V%HXVO$
|
|
M!;\3_'_9QQ_HFM'/#Q<-F.<DCOA?/<]X:O+7G>X<?]#`1*9&RB.L@TNH*X*Q
|
|
M0(15H8@W\GA/`RI>!)0J:,B]]0#^51J`V?420!&WW)@A&5*HH-AM!=5J07M\
|
|
M97V@S#M+*J0%4RAXS(WX98,>Q&LSYSBUH>.IO:Z>#\,;%)EFZDC50^<7Z8F=
|
|
MDM>MK9X4A^5U6ZU#.I,J0Y?&'[-@7QL&RX49#,NR^%-\%.OMN"?1?4G7`(D1
|
|
MFZ$)5.Q:`)"Q!?0L'YBE>+@C'I'/ZR%5^RQ)??T:C5<14?/D#/#JO.G'_(ZF
|
|
MKL8CF@;HAJ8:H?^5=*<]>:Q<^$18/\5OB^?D6=1?]C,`+A2`LTK_ZGKV<^4L
|
|
M"\M91>315,X(GR?A]VVS8Z?`ST5WU0_YN_9WDJF59$\N#SJ4Y"\L."Q1K0Q?
|
|
M9OE<]/LKR>O@$"869=["!5XIHUJXHJ(Y@)^GR=SMK5,8L`BW=)OX2O/CF/3&
|
|
M8V4QFWDB-2KC'H92FW,)0X'8NH'QVM@&R+A[H=3N7+Q(M=R^=9&JW+YRD<77
|
|
MPO'BD\PRO$9^>J9R,KW`&1.Y5]SBVYOK)?>-N,ATV!?[0)\>_6`19D<`DMJ]
|
|
M/ED"]ME^64[3.K3KF'M"D@9L>,-,3C`^%9E]740]?-G_Z5F3A:!OG"ADY>]!
|
|
M_\VF295'@V*LWZ'XA!?#L**BU/`)=$B]T&&I:2E30`&(J+7@E3-0]$D%K"*P
|
|
M5C$Q%I0JGQ>/GS5QQXJ3)<782Y!U</QO]M/]&\G*7#VSI1E5-<;/"FH*W>`@
|
|
M+$!S%F.K26)WZ3(J<A,J+G(^(%5H3TERV(XI"NLJ@0-B/%8*M(SB8I\1"";C
|
|
M95)D,]*850:%^?LHH(^P$E3^7M*EXVW'TW)@LO=4#G#&SFID[ZQC$$#K5J>^
|
|
MYQRF:IA:U5`L'%9@*AEP]]1"J\T/:8D^@"0"+\]-L@5%FHQ(P1$:RO'(@T,?
|
|
MB<.%6+A98[2-+G82==BQ5]JZ$*F_Y-=?B5K;V\D+WA*D:K9QNE<YG+`8AVQQ
|
|
MQ(2/^/3TE*4_+47^$I[!!QN7DM4FY)FJ()\7!'-%$`D+_8BV_'PVY"E?)9E0
|
|
M.T<0&!"V^!IW+I\U__;?)Q<P=2SBN"]X%MG^^FS-:-]BKU1--E.E0#)8J9G-
|
|
M2OG-:AW=[%0;>X)GNE:+\X24R-;V+6$9VA(G(1B](A6>"3&6YZ*8:,QR\N("
|
|
M,!5@LEZEKB1=\X/5^(Y+YFT7R9S9]WP_^]X-BPLR2G?C@G\RI8>O[KSL5&2G
|
|
MBX%3:!B_Q\4N]>3LU9%K+^\J95Y*PEN<KN7Q8G@0"=J"@M_AKM874'#\=:PC
|
|
M2<P4_55=JU;CO8MIEL2RHPT3@\'&B]-#3T\/7`T^+L-B<C.2^7SB.GEH/5"0
|
|
MSDL_9-9OWUN(&UY"MMZL%]!<E+&SV"2#E^Q<F]*RDB_$/5Y)JW(C^O>G55F]
|
|
MG0R2/XSZS)5O5K5JLYF$+YD*FR/])7['E3%3%)_Y,'>#!0<PYK7#PFFI^\QG
|
|
M7/01VQ$OU8?2,&4Y[):KI_"[^E8(\/1=8GPXO6>Z<4]]6O?N`=S5P/4M8^[8
|
|
MQF9'KW6,/:>EK:K64F_6`@]$/KK:(<\MC%.!T1Q2+F=+*44_@IWO\8NE3!I)
|
|
M+I]&5A`==:.4Z2\5S\Y5Z!>XC'N>C#6^H(R(8?L<.\@8`=[*9P-E6%BT(H!9
|
|
MB<(25]K@&[@TT$B<J@KVFZ:T8OGQ924M[X>;_N3G.3-%>:8:6Y1$I_S?6I1C
|
|
M[([_BJ7*%PX\(X8)%O:X)1MVJ[-%PRY<AF1H94N&`VVKG6J]4]V3RM_0M49=
|
|
M^5,'\-I44RCP#(Q9"DJ4^ASU$WL/B1504%_>!W""-MZ"!EQ5!O3!\3>A>J6G
|
|
MPATVX=\A+S"T:F(#HOT;W@]-7S")?)XAD]62YS,4\EN*K!BV7X1F93$+<.>P
|
|
M&G6-!Z/F.C4F]M8**7JR/(T0&V$)D;XMP>*\OWK2J+:UAOHWC!HU`PJ2-+DX
|
|
MBSU),N$4*']QI62SQ(LRCM'F">%\B#Q7_A;<T>#IG*<&6X2_LHNFC+*R0/./
|
|
J5".V8K`1>&?LC3PZT;V_B8@X[<#CAC]SA9Y)<AHC_9+_!=<!%F+O2@``
|
|
`
|
|
end
|
|
|
|
|
|
From oleg@sai.msu.su Sun Oct 18 14:01:49 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id OAA01739
|
|
for <maillist@candle.pha.pa.us>; Sun, 18 Oct 1998 14:01:48 -0400 (EDT)
|
|
Received: from ra.sai.msu.su (ra.sai.msu.su [158.250.29.2]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id NAA23532 for <maillist@candle.pha.pa.us>; Sun, 18 Oct 1998 13:51:14 -0400 (EDT)
|
|
Received: from ra (ra [158.250.29.2])
|
|
by ra.sai.msu.su (8.9.1/8.9.1) with SMTP id UAA17832;
|
|
Sun, 18 Oct 1998 20:45:25 +0300 (MSK)
|
|
Date: Sun, 18 Oct 1998 21:45:24 +0400 (MSD)
|
|
From: Oleg Bartunov <oleg@sai.msu.su>
|
|
X-Sender: megera@ra
|
|
To: Tom Lane <tgl@sss.pgh.pa.us>
|
|
cc: Bruce Momjian <maillist@candle.pha.pa.us>, pgsql-hackers@postgreSQL.org,
|
|
jwieck@debis.com
|
|
Subject: Re: [HACKERS] SELECT ... LIMIT (trial implementation)
|
|
In-Reply-To: <2292.908726689@sss.pgh.pa.us>
|
|
Message-ID: <Pine.GSO.3.96.SK.981018213213.17519C-100000@ra>
|
|
Organization: Sternberg Astronomical Institute (Moscow University)
|
|
MIME-Version: 1.0
|
|
Content-Type: TEXT/PLAIN; charset=US-ASCII
|
|
Status: RO
|
|
|
|
On Sun, 18 Oct 1998, Tom Lane wrote:
|
|
|
|
> Date: Sun, 18 Oct 1998 12:04:49 -0400
|
|
> From: Tom Lane <tgl@sss.pgh.pa.us>
|
|
> To: Bruce Momjian <maillist@candle.pha.pa.us>
|
|
> Cc: pgsql-hackers@postgreSQL.org
|
|
> Subject: Re: [HACKERS] SELECT ... LIMIT (trial implementation)
|
|
>
|
|
> Bruce Momjian <maillist@candle.pha.pa.us> writes:
|
|
> > What if someone wants the rows from 500 to the end. Should we allow
|
|
> > the syntax to be:
|
|
> > SELECT ... [LIMIT count] [OFFSET offset]
|
|
> > LIMIT and OFFSET are independent.
|
|
>
|
|
> I like that syntax the best, but remember we are not inventing in
|
|
> a green field here. Isn't this a feature that already exists in
|
|
> other DBMs? We should probably copy their syntax, unless it's
|
|
> truly spectacularly awful...
|
|
>
|
|
> regards, tom lane
|
|
>
|
|
|
|
Mysql uses LIMIT [offset,] rows
|
|
>From documentation:
|
|
|
|
LIMIT takes one or two numeric arguments. A single argument
|
|
represents the maximum number of rows to return in a result. If two
|
|
arguments are given the first argument is the offset to the first row to
|
|
return, while the second is the maximum number of rows to return in the
|
|
result.
|
|
|
|
What would be nice if somehow total number of rows could be returned.
|
|
This is often needed for altavista-like application.
|
|
Of course, I can do
|
|
select count(*) from sometable ... LIMIT offset, rows
|
|
and then
|
|
select ... from sometable ... LIMIT offset, rows
|
|
but this seems not elegant solution.
|
|
|
|
Regards,
|
|
|
|
Oleg
|
|
_____________________________________________________________
|
|
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
|
|
Sternberg Astronomical Institute, Moscow University (Russia)
|
|
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
|
|
phone: +007(095)939-16-83, +007(095)939-23-83
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Sun Oct 18 14:31:12 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id OAA02288
|
|
for <maillist@candle.pha.pa.us>; Sun, 18 Oct 1998 14:31:10 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id OAA24844 for <maillist@candle.pha.pa.us>; Sun, 18 Oct 1998 14:15:35 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id OAA26655;
|
|
Sun, 18 Oct 1998 14:00:03 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Sun, 18 Oct 1998 13:58:57 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id NAA26381
|
|
for pgsql-hackers-outgoing; Sun, 18 Oct 1998 13:58:55 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from ra.sai.msu.su (ra.sai.msu.su [158.250.29.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id NAA26367
|
|
for <pgsql-hackers@postgreSQL.org>; Sun, 18 Oct 1998 13:58:49 -0400 (EDT)
|
|
(envelope-from oleg@sai.msu.su)
|
|
Received: from ra (ra [158.250.29.2])
|
|
by ra.sai.msu.su (8.9.1/8.9.1) with SMTP id UAA18077;
|
|
Sun, 18 Oct 1998 20:58:41 +0300 (MSK)
|
|
Date: Sun, 18 Oct 1998 21:58:41 +0400 (MSD)
|
|
From: Oleg Bartunov <oleg@sai.msu.su>
|
|
X-Sender: megera@ra
|
|
To: Jan Wieck <jwieck@debis.com>
|
|
cc: PostgreSQL HACKERS <pgsql-hackers@postgreSQL.org>
|
|
Subject: Re: [HACKERS] SELECT ... LIMIT (trial implementation)
|
|
In-Reply-To: <m0zUBum-000EBQC@orion.SAPserv.Hamburg.dsh.de>
|
|
Message-ID: <Pine.GSO.3.96.SK.981018215259.17519D-100000@ra>
|
|
Organization: Sternberg Astronomical Institute (Moscow University)
|
|
MIME-Version: 1.0
|
|
Content-Type: TEXT/PLAIN; charset=US-ASCII
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
Jan,
|
|
|
|
I tested your patch on my Linux box and it works ok, except
|
|
aggregates functions doesn't work properly, for example
|
|
count(*) always produces 0
|
|
|
|
kdo=> select count(*) from work_flats limit 10,1000;
|
|
count
|
|
-----
|
|
(0 rows)
|
|
|
|
while
|
|
|
|
kdo=> select rooms from work_flats limit 10,1000;
|
|
rooms
|
|
-----
|
|
3
|
|
3
|
|
3
|
|
3
|
|
3
|
|
3
|
|
3
|
|
3
|
|
3
|
|
3
|
|
(10 rows)
|
|
|
|
|
|
Regards,
|
|
|
|
Oleg
|
|
_____________________________________________________________
|
|
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
|
|
Sternberg Astronomical Institute, Moscow University (Russia)
|
|
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
|
|
phone: +007(095)939-16-83, +007(095)939-23-83
|
|
|
|
|
|
|
|
From wieck@sapserv.debis.de Sun Oct 18 15:17:53 1998
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id PAA03203
|
|
for <maillist@candle.pha.pa.us>; Sun, 18 Oct 1998 15:17:49 -0400 (EDT)
|
|
Received: by dsh.de; id VAA01180; Sun, 18 Oct 1998 21:19:50 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma001117; Sun, 18 Oct 98 21:19:33 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id VAA25465;
|
|
Sun, 18 Oct 1998 21:17:29 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id VAA14993;
|
|
Sun, 18 Oct 1998 21:19:58 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zUvyS-000B5AC; Sun, 18 Oct 98 18:46 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zUyWO-000EBPC; Sun, 18 Oct 98 21:29 MET DST
|
|
Message-Id: <m0zUyWO-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] SELECT ... LIMIT (trial implementation)
|
|
To: oleg@sai.msu.su (Oleg Bartunov)
|
|
Date: Sun, 18 Oct 1998 21:29:43 +0200 (MET DST)
|
|
Cc: tgl@sss.pgh.pa.us, maillist@candle.pha.pa.us, pgsql-hackers@postgreSQL.org,
|
|
jwieck@debis.com
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <Pine.GSO.3.96.SK.981018213213.17519C-100000@ra> from "Oleg Bartunov" at Oct 18, 98 09:45:24 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Status: RO
|
|
|
|
Oleg Bartunov wrote:
|
|
|
|
> On Sun, 18 Oct 1998, Tom Lane wrote:
|
|
>
|
|
> > Bruce Momjian <maillist@candle.pha.pa.us> writes:
|
|
> > > What if someone wants the rows from 500 to the end. Should we allow
|
|
> > > the syntax to be:
|
|
> > > SELECT ... [LIMIT count] [OFFSET offset]
|
|
> > > LIMIT and OFFSET are independent.
|
|
> >
|
|
> > I like that syntax the best, but remember we are not inventing in
|
|
> > a green field here. Isn't this a feature that already exists in
|
|
> > other DBMs? We should probably copy their syntax, unless it's
|
|
> > truly spectacularly awful...
|
|
> >
|
|
> > regards, tom lane
|
|
> >
|
|
>
|
|
> Mysql uses LIMIT [offset,] rows
|
|
> >From documentation:
|
|
>
|
|
> LIMIT takes one or two numeric arguments. A single argument
|
|
> represents the maximum number of rows to return in a result. If two
|
|
> arguments are given the first argument is the offset to the first row to
|
|
> return, while the second is the maximum number of rows to return in the
|
|
> result.
|
|
|
|
Simple change, just flip them in gram.y.
|
|
|
|
And for the 500 to end:
|
|
|
|
SELECT ... LIMIT 500, 0 (after flipped)
|
|
|
|
The 0 has the same meaning as ALL. And that could also be
|
|
added to the parser easily so one can say
|
|
|
|
SELECT ... LIMIT 500, ALL
|
|
|
|
too.
|
|
|
|
>
|
|
> What would be nice if somehow total number of rows could be returned.
|
|
> This is often needed for altavista-like application.
|
|
> Of course, I can do
|
|
> select count(*) from sometable ... LIMIT offset, rows
|
|
> and then
|
|
> select ... from sometable ... LIMIT offset, rows
|
|
> but this seems not elegant solution.
|
|
|
|
Absolutely makes no sense for me. As said in the other
|
|
posting, aggregates do the counting scan in a deeper level
|
|
and thus cannot get limited. So if you invoke an aggregate,
|
|
the whole scan is always done.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Sun Oct 18 19:08:47 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id TAA00573
|
|
for <maillist@candle.pha.pa.us>; Sun, 18 Oct 1998 19:08:46 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id QAA01305 for <maillist@candle.pha.pa.us>; Sun, 18 Oct 1998 16:14:30 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id PAA06110;
|
|
Sun, 18 Oct 1998 15:55:20 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Sun, 18 Oct 1998 15:54:07 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id PAA05771
|
|
for pgsql-hackers-outgoing; Sun, 18 Oct 1998 15:54:05 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id PAA05753
|
|
for <hackers@postgreSQL.org>; Sun, 18 Oct 1998 15:53:52 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id VAA09240; Sun, 18 Oct 1998 21:56:10 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma008902; Sun, 18 Oct 98 21:55:19 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id VAA28158;
|
|
Sun, 18 Oct 1998 21:53:16 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id VAA15349;
|
|
Sun, 18 Oct 1998 21:55:45 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<hackers@postgreSQL.org>>
|
|
id m0zUwX6-000B5AC; Sun, 18 Oct 98 19:22 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for hackers@postgreSQL.org
|
|
id m0zUz52-000EBPC; Sun, 18 Oct 98 22:05 MET DST
|
|
Message-Id: <m0zUz52-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] SELECT ... LIMIT (trial implementation)
|
|
To: terry@terrym.com (Terry Mackintosh)
|
|
Date: Sun, 18 Oct 1998 22:05:31 +0200 (MET DST)
|
|
Cc: hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <Pine.LNX.3.95.981018155322.29282B-100000@terry1.acun.com> from "Terry Mackintosh" at Oct 18, 98 03:58:57 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
>
|
|
> On Sun, 18 Oct 1998, Tom Lane wrote:
|
|
>
|
|
> > Bruce Momjian <maillist@candle.pha.pa.us> writes:
|
|
> > > What if someone wants the rows from 500 to the end. Should we allow
|
|
> > > the syntax to be:
|
|
> > > SELECT ... [LIMIT count] [OFFSET offset]
|
|
> > > LIMIT and OFFSET are independent.
|
|
> >
|
|
> > I like that syntax the best, but remember we are not inventing in
|
|
> > a green field here. Isn't this a feature that already exists in
|
|
> > other DBMs? We should probably copy their syntax, unless it's
|
|
> > truly spectacularly awful...
|
|
> >
|
|
> > regards, tom lane
|
|
>
|
|
> None that I have used (VFP, M$ SQL Server) that had 'LIMIT', had 'OFFSET'.
|
|
> So it would seem that the very idea of OFFSET is to break with what others
|
|
> are doing.
|
|
>
|
|
> I too like the above syntax.
|
|
> Why mimic, when you can do better? Go for it!
|
|
>
|
|
|
|
We have a powerful parser. So we can provide
|
|
|
|
... [ LIMIT { rows | ALL } ] [ OFFSET skip ]
|
|
|
|
or
|
|
|
|
... [ LIMIT [ skip , ] { rows | ALL } ]
|
|
|
|
at the same time.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Sun Oct 18 19:08:39 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id TAA00557
|
|
for <maillist@candle.pha.pa.us>; Sun, 18 Oct 1998 19:08:37 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id QAA03555 for <maillist@candle.pha.pa.us>; Sun, 18 Oct 1998 16:56:03 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id QAA10374;
|
|
Sun, 18 Oct 1998 16:36:26 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Sun, 18 Oct 1998 16:35:16 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id QAA10298
|
|
for pgsql-hackers-outgoing; Sun, 18 Oct 1998 16:35:15 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id QAA09974
|
|
for <hackers@postgreSQL.org>; Sun, 18 Oct 1998 16:32:21 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id WAA18249; Sun, 18 Oct 1998 22:34:46 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma018115; Sun, 18 Oct 98 22:34:11 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id WAA29950;
|
|
Sun, 18 Oct 1998 22:32:01 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id WAA15581;
|
|
Sun, 18 Oct 1998 22:34:28 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<jwieck@debis.com>>
|
|
id m0zUx8Z-000B5AC; Sun, 18 Oct 98 20:01 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for jwieck@debis.com
|
|
id m0zUzgV-000EBPC; Sun, 18 Oct 98 22:44 MET DST
|
|
Message-Id: <m0zUzgV-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] SELECT ... LIMIT (trial implementation)
|
|
To: jwieck@debis.com
|
|
Date: Sun, 18 Oct 1998 22:44:15 +0200 (MET DST)
|
|
Cc: terry@terrym.com, hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <m0zUz52-000EBPC@orion.SAPserv.Hamburg.dsh.de> from "Jan Wieck" at Oct 18, 98 10:05:31 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
> We have a powerful parser. So we can provide
|
|
> [...]
|
|
|
|
This version now accepts all of the following
|
|
|
|
... [ LIMIT rows ] [ OFFSET skip ]
|
|
... [ OFFSET skip ] [ LIMIT rows ]
|
|
... [ LIMIT [ skip , ] rows ]
|
|
|
|
rows can be a positive integer constant greater that 0, a $n
|
|
parameter (in SPI_prepare()) or the keyword ALL. 0 isn't
|
|
accepted as constant to force ALL in that case making clear
|
|
that this is wanted. In the parameter version the integer
|
|
value 0 still is used to mean ALL.
|
|
|
|
skip can be a positive integer constant greater or equal to 0
|
|
or a $n parameter for SPI_prepare.
|
|
|
|
If any of these syntaxes is used in SPI_prepare()'d plans,
|
|
the given tcount argument for SPI_execp() is ignored and the
|
|
plan or parameter values are used.
|
|
|
|
Anyone happy now?
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
begin 644 opt_limit.diff.gz
|
|
M'XL(")%0*C8"`V]P=%]L:6UI="YD:69F`.4\:W?;-K*?U5^!:-.NY-"V*-EZ
|
|
MN?$>5:83;67)U2-MSSWWZ-`29/-&(E62BN--_=_OS``@08F4E4>;[JY/&Y'`
|
|
M8(`9#&8&@P%GSGS.#J<^"_SID><[M\<W]O0M=V?'4V^YM-U9H!Z.I@BSH_J;
|
|
M@X.#?=#D+GV']:<A,ZO,-)NGE6:ESLQ&H_[-X>'A$WTDVI:;I]7FR8EH>Y#\
|
|
MH\%4&@94TRNBQM=3!D^'WS#V-\>=+M8SSO+KT%D$Q\OI^_#H+I]2L_+\T%ZD
|
|
MUP4/P=2>WG&L?:'5!J'ON+>B#?QW?$#=ZG]0R@YRN6M";KWGTW7H^:^YO;KB
|
|
M2\]_8$&XGL]3J3)+IF&6JC%=9JD,!75%6<YQPUPN-^=VN/;Y&9;\M.;^PP4/
|
|
MIM#G;^J9:D1O;<\-^?N03<7O&="2@[(`\"R<Y=1;N^&9H"273DH.:`ELUPD?
|
|
M&'!C^C9('[E9,4RSKHW</#7,RDDT\L<=G;R@3MH^D,69C4,-0N9ZP.VY[RU9
|
|
M>,?9K?..NXR&R][9BS47C3)P'>//DB\#'A:^4V0:K&2PP/D7]^8%558L$D/4
|
|
MZU'XL.*YW$LVFA"/DI4T+H)XR3J]T4F_<Y$"L.`NU,M^8+Y.BBE`1`*`%2Y@
|
|
M)I=%.0U;8$[@KA<+@+ML=8=6"L#-`V#"\0[&5FI[X,".Y@Z(>)``0)AL.;CE
|
|
M(<V&6#?Q[,AWUUYR`9F!X#A==FI5PZR7A>P`7.;"`7XEA+J(T!+\%0]CR((8
|
|
M4)$D^QG+*5R#M5N(%HG!HI;#$"1/-3*87%^&D+>B6A_.G!5F'$3SY4O6\UQ>
|
|
MA*4(O+KJO[&0-!S+:NYS'G<!36DM-&`5-^)5_-<AL-##5790[(V[W?@M6C*?
|
|
M1_HLTP!Q.5QZN+(==\,"I=2GFZ`4P/UMT!.-RZ5FZ;1I-K*-4/7$J#9BE0>O
|
|
MM9*:Y``X[DS9.\^9,<N=72]LMX#_L(,5_&LPBZ:$'7`$Y,1HV62T7BWXR+Y9
|
|
M\.'""]F!F%M.&)*M8()CC#0)PAQ`B>T'?`038K#V<C8"G<6\%?<!OP>0+Q`2
|
|
M5!/SYG-0$-1A(!%@L;M>WG!?%K/AU'8O')]/L3&;J2<)3P06#E9@%,/YVIT6
|
|
M6:&H4R,8`"0,.!A._HX7-ND+X%^#I>$QTM5%%4R-XCN:VBUE\[%_A`6E>'-H
|
|
ML+3TE17;V@-MD6W."7(P6F#X(I82H/^`+)/SD<M%$Q+;\1P.))H[J3Q`K,Q8
|
|
MKO[:Y`H%`NI#2)96\.E,2#<9X"C5=$>I!BJVUHA4["9I/@_6"_)TP/LAC_,"
|
|
MU1GJ-"I$\<OE"@=8X+@T(I1`LIGD=`F"0$.7XC+ACE"1<FZV_O;WH!I`4D,G
|
|
MJ7%BE.LQ28+AA^<\F*Q\;\K!ML]4YWKE`DPZ+B9P4ESP#IQ9WYG1`%^D#U!X
|
|
M7Z#AA8^E;+PDV)O36[=SU1FQZ<)>!]+URD!$KA>:BT@(V#.P&6!?BECS@92/
|
|
M=$`/I@*".)J[MGU["86KK<*N$X0==^[E5O36$54T!8YR6G+!O1-.[U@!_<:1
|
|
M?1OW7Z2.9<\Y<'BX<N^:HB@WC2:W0.5@`J/69Q(F!H%^BP79Y/`\=N>*"O0&
|
|
M'-FWT;A4CT2)ZG$5HZ/RM!XEL0"CSSP63A;`D(D#'(E[(9Y'332.XQ]?>+<%
|
|
M:S#H#PR6)R`><I_-/9]!MXXR!.!SA[">F3*,I,-Y/B(,X0L.R1R37?V/\[]'
|
|
M;QUWAI-\W1JTKB:=WIM6%SQCYKQXH0;P00U$&V34\J5JV1M?L>^^TS$[5+M2
|
|
MO*8:9Q:1%7,:7QZW&;'=AQS=E^;,1I_";T_O)($63"MBON$T7V)G$V/=D#FM
|
|
M@UC@TF5NQN<VZ#LE;HD!K%W^?@4V'+0'[;%P1\.^G3$[2)"<-UC*4A)#>U0=
|
|
M(=URE-^SDB1X'W)=?@M*]ITB]C':>NRMGX3R_1SU)##LT$[QMBQ63IMEGZJ;
|
|
MY/;S2=6D;(RNF;0QQ$9(ZB5Z_62U%"'3M%*BMS]3*8G!_$5UDN3SUU!)>_'E
|
|
MHS62P+I;(25E[<_21]1K4AWIP9ND-A)#W*F,-BE-TT7@4JE%*YU;VHNC\YKA
|
|
MO36,<NDTBF'(/Z9MS61!M#/32[7-V;.XM-7M3D;CZZXU3`!?>OZ][<\2^[($
|
|
M@.[$GHD*.1_H5Y8;9:/<B/W*+S)4Z>_K12+H]H4&GL;S<JENE,V*%@4NE8V*
|
|
M%BW]=,I>;%/&OB!EJ=2485K*-8T:LV94RI6O0,T/]O3M9Y-3J1GED[)&3J5B
|
|
M5"J-+TM.*0'3[UEBO7SIN:G"WJRJ[\UJ];I1:YQHQ#P=B1'%GQN.$<6?$)-)
|
|
M):QV4C9J)YK0U4]*1OU4FZ4/`J78/S-API-+,Z&",QPX].!@'D"%W[-[]-V6
|
|
M0$S(%P_2?G$11D8>PM:<!6^=E6HF>PZ)?`8JWN>_K6'&P%S</$C'3S#N*.H)
|
|
M9G,!N*&GX,Y;+V8LA*D(/3;SF$-6<\8Y\)\M^#N^"%0SP+WR@L"Y67!6F*WQ
|
|
MF`=@9_P]"X#=105VR/YIN_(EB]CC#>]83!X[5V9).A^'AWJUM+)X4N.X:W&\
|
|
M$YV;I')VCP@KSC^*1I`58M4`GHBQ:I"?$&3-:MUHELQFJ9P=934K)<.LG&J!
|
|
M$2K84O,Y$9D6;!/>*<BK.%5282T1!R^J<`C-$,6V<(4>GI-_T$]$+=COO[--
|
|
MB/;6QF'#S]"W(^*49/A3ET4<(-_M`?IP<!DLN0O"+)P/&CL.RK7]6RDOVBH4
|
|
MQW].ECDL@0M252X(MI"N"WILW=80G/W^</33V.I.VOVKJU;OHL"#8I']@UF_
|
|
M6.W)P!J!`F5-^3;NJ<,%GP,2-Q$:Y,'A^6\S@^&O"@1&0<`2\5<*KA8&%MY'
|
|
MJ0*CC!3,UQME^M$'S6C&\/=8;<'*R5IG5/7$"B.8E+75>&)M;;=K-,NGS4HE
|
|
M>U75P*35$B:-"N)5=>-Y"Y"U0,7M.V[H#?B";!9,U=Q>!$)%3>]LGV*V()U>
|
|
M#W8KNI40$NM+Y29WUB&Y')-I=,2*S*=PH-A>J-:Z%Q[92^F'I]%4+Y\:]4I)
|
|
MLV;EFE&OGFJ:0DK)\+HSH<4ZZ5^/>S_V^C^3'`EM^V+7*?4K&7X@G2#-#]DF
|
|
MX4<]?2PM-V;"*<A4*!^D-4W%A#W07TOMD(2F`<-&9A2QAX"=>>^X[SO@WV"Y
|
|
MLE^"^RS:62J,&3T)4Z:F)G7DT;8)M*"(O^PW_)Z7"-S@^*.Q'[%Q@,D`"V]J
|
|
M+U@[2@F(J/#8:@U3(0:&H@>F?2]25%Z`+H5Z;H!>KK:8>ADE";#X+YDMD`2-
|
|
M4@84J)XWD`*ZX&Z$=2N%(`6>]MT27N82A%J\9KN!B`.(!G%60`H@918H^E1V
|
|
M02I"-)9[(!2Y!AN`"=F*CI^W](/:DN/:D89=KA5Q#DV.X\KG*-.DG^29M%SM
|
|
MI'XPPV<^XW.Q]L%XC$>P_(>CUFB8KDQ.0)F<FO&.7E,=_1\G[?%@V!_$2B/[
|
|
M$%X:'K)7E^@>A,FS]0EBG*Y]GU-`23O:23GP.5-LB/0ALJ)]=3$96EVK/<)@
|
|
M5=3WX;DZLX<NBD(CUL&_KU=CC?C%J-K4#'UY`OB5B,TVUJA%,.]L]8`.V:9?
|
|
MO%6;;K*WP+:M]DDIW6KO;@K6OMP$;S?;'3ZMU0WSM*[%/521-JTNO\=^#L_7
|
|
M+K"M+?3K2Q;RY8H"M0EKA^L.5-CJH8"^JL%D8X-I$UD\>QJR+:=:+%8I61(D
|
|
M\JF>FAAO'6;/2URY:UIBJ(^=E8R6,"FEYFDI>U+*IPVC7-42$D5![$W9JQ7T
|
|
M,J3<03PN`#7F&^)`X/#\S@Z&ZYNNX[X-P,W-A_Z:Y\'#S9./E1?I%!D8`$Z?
|
|
MX;R`G0`=.%4"9DL0Q%QF(M1W03(FFH9P2SAV(Q0^SE/XVI&V^"@'7,P>;/5F
|
|
MV;*CU>X2'@WL8Z4GJRF(C]D\V>&,BY06/7L2"VIQ?"GTH!M8OXM@!8\%L6?Y
|
|
M#KR%V_"N>$8V$&,F26$01I`<J$F:,L"R`0RY@`(7[8GW[DH7$^%:Z5WIM=M=
|
|
MO?B$KH0`9?34ECY$2D>Q*HI;[:&-R)[YQ[9K+Q[^Q3<D:K,R7:`VH5+DR4R7
|
|
MIYTM*\TR;.]J.\0)C+Q9UU-)L*!1TC?=ZOQOR!=\&@[#9=B4,0:PM\\*A;@<
|
|
M/++8N!?!:%,6(>9ZRM"'#$[$,<+0M]U@[OG+&$EA)1V&3,3JC$E@V12@C0'I
|
|
MX]$`4W$HT7@*15L[YWP4)-%V)HNX]MH/,**T'W%ZQ/1)H;N%K=G10YI$B)J=
|
|
MXB9`4F2MO%/69+/AVA7-ZJQ<;I9+S4HU6]#`13[5MMOX:FYGX4]MD!?O]AA^
|
|
M46K2T^TQ?K998T_1#SQ^;T_#S2Q\A7-U.\'M592.+WW\JW%WU/GAUY&5P+>\
|
|
M0?A[#%A@@SV2"W,YF&+8B9#*!*<77FX>R($R1&A8G'T'ZDW4()7BD5"@JQ2A
|
|
MH!<!YJW"B>W[]L/D!D1O%J`[%6P4B6#F^Y4OV]AAZ`,@2.,DM/U;'LKRC8*R
|
|
M.(D#5FA]P1"C@P)VZWOK532J$<C1+?<OP7BU_-M`\/);VKA^C_KR/(=V;2+T
|
|
M:%R%,2)NN^<Y6'L3Z$)+6#1C7_2_DH6B-;8)2"=,2,=\,F/3SYPQK;RVD3%8
|
|
M,C4U+]$YL)>'GG@XO9O<>?>3I>T^"'.?&(H^THD(+<@B<4PQ43<O8LS(FO-<
|
|
M?Q4.^6^8"A/7`./X(JJS%GR9=79>KM:,<JT2KSD*#TRN#79IC=JOX:?;;XW@
|
|
M![>9\(_5>=6#AT'_"OY%GX$:O1JT>@#T:M`?0\O7K3>=WBOX[8\'@(D@.M"J
|
|
MT^M96``/0ZLW[(PZ;RSQ,ACA[\@:O&EUZ:D/_P[%-/ZSCVU_M'XU6+?5>S5N
|
|
MO8)&7:MU07UTK4MHV^W\B(7]=DL.Z*I%@[_J],8CHN:JWQN]QH=>ZTH=Z?=:
|
|
MHTZ_ASW"TWA`#^W7+2"T9_T"6'M]_!\?@%!J.[ZR!IVV:-V_-%B_A_]W86C]
|
|
M:\0%OP/\_\+"'^B:Z*<]2<T$/M?BA?GOS&>,&_YUV9TJYHV&4=%31#J]]L"Z
|
|
MLGK$D`OK%_QY#;V-AL2I$="-K.E%K`>>M/M7UYTN]M7M`$0/68!@R`@Q''CZ
|
|
MD<AO_0(<'EO$$O74?R//QGO6ST@I#*`ULBY^B)_'0T3>Z_<LXL1KXCP\="Y_
|
|
MI5\Q&IRL?N=BB%RP!JT1\N"Z-1S^#(R`IT&_;5T0>ZFS@=7N7`.^@853@;]#
|
|
M:X0_,`4]P#'HPV`&8W5N/[1^&EN]-@#"6#HX1P".O!^^1L#AJ(4RA)$[R;WA
|
|
MZ`+E!GY@`@P,5P)K8!R$;=Q3G!KW1AU`]J;5'H^O\+?;`:`WUN"'_M"BAR%P
|
|
M4&:25&"NM&/P?_>YNKPDEO^GSEE&3`;TW:E^Q@5OIT:Y6M*<DQR3GKWF8K#[
|
|
M.^YS^2*A=&//[NQWCGN;!*"]M0+07!ZQH6`IGD!.VSOE])T#[HF7]EM.$9*X
|
|
M0NTB<CF7MO*_K7$7_[R<<1Q<+9615LT_P#<H*I<31^<N!J"0GB@P\+R^T9,6
|
|
M-'C>T.N0SKC*+)WI297/39/.M#IQSF>4=(F--T($X5VA9``2,]H*QE#1]AZ`
|
|
MS`V@QSBI47%[9T?B=#&KAT3MHR+U^7/M?,)5'!#56>S'C,!:28\)UDSPSQ)I
|
|
M0[^SUK`=)3!\8-1/_OO\&7N4]1=6"L"Y!G!\8%U=CWX].$[#<GP@,R\/CIG@
|
|
MDXR];(IC,R=.XE+\/?9WX^]LVS6D8S[1U4)$_@KJMX=+]GFY"/^<%,]DQ[^S
|
|
M1`^ZDRFT4X:ON;N3DR+U]'0G3V&BTZ0-7-GC^N1NMKGR&<0C+I&P$&$[/F`D
|
|
M#C).)IL#^%ER_K='WLQUZ,!-Q73$$A`G]DF-1&7%C2SOYR;[GIGI"<:B,SR^
|
|
M$<?5RW5`B;>M;I=Y/K,Q[<K!#%P\Q^6PB1)YQYC^DM?[<65>>\H=;*TVXP*V
|
|
M!K%Q^_JYN0F0<N]:J]VZ=)UHN7GC.E&Y>=TZ3;?(`)283V#1QTS('\BETE^7
|
|
M291'GV23N*^19!.5Z612X$$DX\?I_IOU=),N(2)43)O<+>[N&&9RX>GK_#-6
|
|
MWG_ABOC+3[;P">3'09Z.];[E#_>>/PO23QCBVITQWQCLH^.^6M-D[)?.&78<
|
|
M,H`S8Y;C_(@/>8K?Y>4&Z=%0A>!+.V&0CW=,<97+_7P4*X#B0U$,PPKS*EH0
|
|
M0P?<%68BGP@K:``AMV?Y:$,65X3<!P'-QX$'K<H3Q?U'0WZW1)"E7+0O0=:3
|
|
MXW^QF^[/)"MU]LIUPZSH.014<*+1O8!>85>0CR(P"ON"S\.\",=$1<Y;I`IC
|
|
M,XH<,O5Y&:F)X8`8-Z]VQ%&Q1P3B]C@NFA)I%.')(N*T:IA5,Y$(`045C0C4
|
|
M6GD5SE&XW?42)F^:C\([JL*;0UG_4I$07804/EH$Y<QPWG$O'14M</SM\0"V
|
|
MRS$<$MI'(K.5@,_O08RX^KW@<\?=/&S,@$E7"!G`*6JAFJX6]D$`K>O-TQU)
|
|
M*A6S;%3,Q&66<O(RRQVW<3OM!;S`WW$WG/@RMS1*%<H=JVQZZ6T[+GOG\'OF
|
|
MB`3J8+W"HT@^V\JYIYR@W=G<GY3)G>A5#2?07-1C$=B#$1\='=%-AKF\BH!Y
|
|
MF?YZP2/?%P\X9@S3F1$)G8O)MB)U+1"W-PKJ;MP$06!`V.);%#_!->_F_P[/
|
|
M@75T'+OK9#&<>JOC%=&^(5Z)FG2A2H"DB%(M792RFYTT2^7=)XLEXR1*9=>.
|
|
M_38_&Z;._62:"![ML0.1\]]726-X9Y"NUT0%8,Z!6<\2WRBY%EEGT4<O4C]_
|
|
MH80S_<-?'_TA#CHT)4JW#TV_,J5/?\OCR[(B_4;#J5$VS3_B2R]Z6M&S/>=>
|
|
M?;PD]2LE^%FGA>V*8GB0=RTE!7_`QUL^@8+]O\^R)XFIJK]2,BJ5:.UBKC:S
|
|
MI^&:U*"_=J.;7D='3WPK;+_DT\&XIZZ<R._+!39%$N9>0)>F+7<F/_DB=>MX
|
|
M-8/FLHP2U>++>&SK.RI&6F:J_+"7HE7[1-H?3ZLV>UO?Z?K3J$^=^5K%J-1J
|
|
M\0$7F;`)TE\0'[V*LON+4?ZIB-KB&<66"">U[J/@N.PC\B.^5!]:PXT$YLUR
|
|
M/45QV]Y*!9[\N!@^'-V1;=Q1G[2].P"W+?#IAC.W;^-RLW32-'>DDM4K1EW_
|
|
MU!;(0.CA=C@0]]KB.VWR6P3R:VU*2_'WL$EQQ9>F2!MI-_'\<*]/3)']TO%L
|
|
M?1OM"WR=ZRP>:_3%,D0,RV??048(\#-]-%#"0C$;'[@2!@5AM&%CL^"^P:+;
|
|
M5."_&5HKNDY4U&XL_#2V!K].R!452?PT*;%-^<^:E'W\CG^+J<I6#B)=F!0+
|
|
M/6[HANWJ=-6P#9>B&>KIFN&)MI5FY;19V7$KMUHRJMJ=7'RMZ?FEF`Q$GH)V
|
|
M?'B&]HG>`V;[',R7^Q8V06MWQGUA*GW^SO'6@7X[_T!>M1/[.[HIB&CUK$]$
|
|
M^S?\%%#RKGCHB?3AM)8BV3.7W5*F#--ZD9:5`BZPG<-JM#4NC%K8U(C8&SO@
|
|
MN),5=RRP$98PM;=E6)SU&=1JI6%4]8\:5T],*(CO$$07+>,,7$&!]@G6PI2R
|
|
M4HLXQJGXH*X8HKC.>0/;4?_A3-R:LIEXI<^Y$&5%B>8?B48T8[`01&?TQNZ=
|
|
>\,Y;ATP>0^,Y\-><H4<6'Y.K?<G_`Z+L!%(`6P``
|
|
`
|
|
end
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Mon Oct 19 07:31:10 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id HAA05591
|
|
for <maillist@candle.pha.pa.us>; Mon, 19 Oct 1998 07:31:09 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id HAA13574 for <maillist@candle.pha.pa.us>; Mon, 19 Oct 1998 07:12:57 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id GAA13957;
|
|
Mon, 19 Oct 1998 06:25:09 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Mon, 19 Oct 1998 06:22:35 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id GAA13581
|
|
for pgsql-hackers-outgoing; Mon, 19 Oct 1998 06:22:33 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id GAA13566
|
|
for <pgsql-hackers@postgreSQL.org>; Mon, 19 Oct 1998 06:22:27 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id MAA13918; Mon, 19 Oct 1998 12:21:16 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma013635; Mon, 19 Oct 98 12:20:55 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id MAA11037;
|
|
Mon, 19 Oct 1998 12:18:27 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id MAA29382;
|
|
Mon, 19 Oct 1998 12:20:49 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zVA2V-000B5AC; Mon, 19 Oct 98 09:47 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zVCaT-000EBPC; Mon, 19 Oct 98 12:30 MET DST
|
|
Message-Id: <m0zVCaT-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: Inoue@tpf.co.jp (Hiroshi Inoue)
|
|
Date: Mon, 19 Oct 1998 12:30:52 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, pgsql-hackers@postgreSQL.org, maillist@candle.pha.pa.us
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <002801bdfb46$39ad8ec0$2801007e@cadzone.tpf.co.jp> from "Hiroshi Inoue" at Oct 19, 98 06:52:46 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
Hiroshi Inoue wrote:
|
|
|
|
> When using cursors,in most cases the response to get first(next) rows
|
|
> is necessary for me,not the throughput.
|
|
> How can we tell PostgreSQL optimzer that the response is necessary ?
|
|
|
|
With my LIMIT patch, the offset and the row count are part of
|
|
the querytree. And if a LIMIT is given, the limitCount elemet
|
|
of the querytree (a Node *) isn't NULL what it is by default.
|
|
|
|
When a LIMIT is given, the optimizer could assume that first
|
|
rows is wanted (even if the limit is ALL maybe - but I have
|
|
to think about this some more). And this assumption might let
|
|
it decide to use an index to resolve an ORDER BY even if no
|
|
qualification was given.
|
|
|
|
Telling the optimizer that first rows wanted in a cursor
|
|
operation would read
|
|
|
|
DECLARE CURSOR c FOR SELECT * FROM mytab ORDER BY a LIMIT ALL;
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Tue Oct 20 06:01:49 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id GAA02483
|
|
for <maillist@candle.pha.pa.us>; Tue, 20 Oct 1998 06:01:48 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id FAA07799 for <maillist@candle.pha.pa.us>; Tue, 20 Oct 1998 05:51:19 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id FAA00108;
|
|
Tue, 20 Oct 1998 05:17:58 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Tue, 20 Oct 1998 05:16:37 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id FAA29953
|
|
for pgsql-hackers-outgoing; Tue, 20 Oct 1998 05:16:35 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id FAA29939
|
|
for <hackers@postgreSQL.org>; Tue, 20 Oct 1998 05:16:27 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id LAA04585; Tue, 20 Oct 1998 11:15:05 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma004337; Tue, 20 Oct 98 11:14:46 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id LAA14628;
|
|
Tue, 20 Oct 1998 11:12:27 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id LAA03564;
|
|
Tue, 20 Oct 1998 11:14:52 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zVVUa-000B5AC; Tue, 20 Oct 98 08:42 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zVY2c-000EBPC; Tue, 20 Oct 98 11:25 MET DST
|
|
Message-Id: <m0zVY2c-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: Inoue@tpf.co.jp (Hiroshi Inoue)
|
|
Date: Tue, 20 Oct 1998 11:25:22 +0200 (MET DST)
|
|
Cc: maillist@candle.pha.pa.us, jwieck@debis.com, hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <000601bdfc03$02e67100$2801007e@cadzone.tpf.co.jp> from "Hiroshi Inoue" at Oct 20, 98 05:24:09 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
Hiroshi Inoue wrote:
|
|
|
|
> > * Prevent psort() usage when query already using index matching ORDER BY
|
|
> >
|
|
> >
|
|
>
|
|
> I can't find the reference to descending order cases except my posting.
|
|
> If we use an index scan to remove sorts in those cases,backward positioning
|
|
> and scanning are necessary.
|
|
|
|
I think it's only thought as a reminder that the optimizer
|
|
needs some optimization.
|
|
|
|
That topic, and the LIMIT stuff too I think, is past 6.4 work
|
|
and may go into a 6.4.1 performance release. So when we are
|
|
after 6.4, we have enough time to work out a real solution,
|
|
instead of just throwing in a patch as a quick shot.
|
|
|
|
What we two did where steps in the same direction. Your one
|
|
covers more situations, but after all if multiple people have
|
|
the same idea there is a good chance that it is the right
|
|
thing to do.
|
|
|
|
>
|
|
> Let t be a table with 2 indices, index1(key1,key2), index2(key1,key3).
|
|
> i.e. key1 is common to index1 and index2.
|
|
>
|
|
> And for the query
|
|
> select * from t where key1>....;
|
|
>
|
|
> If PosgreSQL optimizer choose [ index scan on index1 ] we can't remove
|
|
> sorts from the following query.
|
|
> select * from t where key1>... order by key1,key3;
|
|
>
|
|
> Similarly if [ index scan on index2 ] are chosen we can't remove sorts
|
|
> from the following query.
|
|
> select * from t where key1>... order by key1,key2;
|
|
>
|
|
> But in both cases (clever) optimizer can choose another index for scan.
|
|
|
|
Right. As I remember, your solution does basically the same
|
|
as my one. It does not change the optimizers decision about
|
|
the index or if an index at all is used. So I assume they
|
|
hook into the same position where depending on the order by
|
|
clause the sort node is added. And that is at the very end of
|
|
the optimizer.
|
|
|
|
What you describe above requires changes in upper levels of
|
|
optimization. Doing that is far away from my knowledge about
|
|
the optimizer. And some of your earlier statements let me
|
|
think you aren't familiar enough with it too. We need at
|
|
least help from others to do it well.
|
|
|
|
I don't want to dive that deep into the optimizer. There was
|
|
a far too long time where the rule system was broken and got
|
|
out of sync with the parser/optimizer capabilities. I fixed
|
|
many things in it for 6.4. My first priority now is, not to
|
|
let such a situation come up again.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
|
|
From wieck@sapserv.debis.de Tue Oct 20 13:00:04 1998
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id NAA08269
|
|
for <maillist@candle.pha.pa.us>; Tue, 20 Oct 1998 13:00:01 -0400 (EDT)
|
|
Received: by dsh.de; id TAA14203; Tue, 20 Oct 1998 19:02:15 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma014037; Tue, 20 Oct 98 19:01:39 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id SAA24445;
|
|
Tue, 20 Oct 1998 18:59:16 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id TAA06159;
|
|
Tue, 20 Oct 1998 19:01:40 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zVcmS-000B5AC; Tue, 20 Oct 98 16:29 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zVfKV-000EBPC; Tue, 20 Oct 98 19:12 MET DST
|
|
Message-Id: <m0zVfKV-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Tue, 20 Oct 1998 19:12:19 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, Inoue@tpf.co.jp, hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810201645.MAA07946@candle.pha.pa.us> from "Bruce Momjian" at Oct 20, 98 12:45:49 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Status: ROr
|
|
|
|
>
|
|
> I agree. Another good thing is that the LIMIT thing will not require a
|
|
> dump/reload, so it is a good candidate for a minor release.
|
|
|
|
That's wrong, sorry.
|
|
|
|
The limit thing as I implemented it adds 2 new variables to
|
|
the Query structure. Rewrite rules are stored as querytrees
|
|
and in the existing pg_rewrite entries that would be missing.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Tue Oct 20 13:24:47 1998
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id NAA08484
|
|
for <maillist@candle.pha.pa.us>; Tue, 20 Oct 1998 13:24:45 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id NAA01878;
|
|
Tue, 20 Oct 1998 13:00:06 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Tue, 20 Oct 1998 12:59:59 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id MAA01579
|
|
for pgsql-hackers-outgoing; Tue, 20 Oct 1998 12:59:58 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id MAA01557
|
|
for <hackers@postgreSQL.org>; Tue, 20 Oct 1998 12:59:52 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id TAA14203; Tue, 20 Oct 1998 19:02:15 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma014037; Tue, 20 Oct 98 19:01:39 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id SAA24445;
|
|
Tue, 20 Oct 1998 18:59:16 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id TAA06159;
|
|
Tue, 20 Oct 1998 19:01:40 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zVcmS-000B5AC; Tue, 20 Oct 98 16:29 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zVfKV-000EBPC; Tue, 20 Oct 98 19:12 MET DST
|
|
Message-Id: <m0zVfKV-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Tue, 20 Oct 1998 19:12:19 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, Inoue@tpf.co.jp, hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810201645.MAA07946@candle.pha.pa.us> from "Bruce Momjian" at Oct 20, 98 12:45:49 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: ROr
|
|
|
|
>
|
|
> I agree. Another good thing is that the LIMIT thing will not require a
|
|
> dump/reload, so it is a good candidate for a minor release.
|
|
|
|
That's wrong, sorry.
|
|
|
|
The limit thing as I implemented it adds 2 new variables to
|
|
the Query structure. Rewrite rules are stored as querytrees
|
|
and in the existing pg_rewrite entries that would be missing.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
|
|
From wieck@sapserv.debis.de Tue Oct 20 13:10:22 1998
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id NAA08339
|
|
for <maillist@candle.pha.pa.us>; Tue, 20 Oct 1998 13:10:18 -0400 (EDT)
|
|
Received: by dsh.de; id TAA17171; Tue, 20 Oct 1998 19:12:30 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma017064; Tue, 20 Oct 98 19:12:00 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id TAA24806;
|
|
Tue, 20 Oct 1998 19:09:37 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id TAA06212;
|
|
Tue, 20 Oct 1998 19:12:01 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zVcwS-000B5AC; Tue, 20 Oct 98 16:39 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zVfUW-000EBPC; Tue, 20 Oct 98 19:22 MET DST
|
|
Message-Id: <m0zVfUW-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Tue, 20 Oct 1998 19:22:40 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, Inoue@tpf.co.jp, hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810201702.NAA08286@candle.pha.pa.us> from "Bruce Momjian" at Oct 20, 98 01:02:58 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Status: RO
|
|
|
|
>
|
|
> > >
|
|
> > > I agree. Another good thing is that the LIMIT thing will not require a
|
|
> > > dump/reload, so it is a good candidate for a minor release.
|
|
> >
|
|
> > That's wrong, sorry.
|
|
> >
|
|
> > The limit thing as I implemented it adds 2 new variables to
|
|
> > the Query structure. Rewrite rules are stored as querytrees
|
|
> > and in the existing pg_rewrite entries that would be missing.
|
|
>
|
|
> Oh, sorry. I forgot. That could be tough.
|
|
|
|
But it wouldn't hurt to add them now to have them in
|
|
place. The required out-, read- and copyfuncs are in
|
|
my patch too. This would prevent dump/load when we
|
|
later add the real LIMIT functionality. And it does
|
|
not change anything now.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Tue Oct 20 14:57:36 1998
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id OAA11449
|
|
for <maillist@candle.pha.pa.us>; Tue, 20 Oct 1998 14:57:34 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id NAA03547;
|
|
Tue, 20 Oct 1998 13:10:38 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Tue, 20 Oct 1998 13:10:23 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id NAA03488
|
|
for pgsql-hackers-outgoing; Tue, 20 Oct 1998 13:10:21 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id NAA03455
|
|
for <hackers@postgreSQL.org>; Tue, 20 Oct 1998 13:10:10 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id TAA17171; Tue, 20 Oct 1998 19:12:30 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma017064; Tue, 20 Oct 98 19:12:00 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id TAA24806;
|
|
Tue, 20 Oct 1998 19:09:37 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id TAA06212;
|
|
Tue, 20 Oct 1998 19:12:01 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zVcwS-000B5AC; Tue, 20 Oct 98 16:39 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zVfUW-000EBPC; Tue, 20 Oct 98 19:22 MET DST
|
|
Message-Id: <m0zVfUW-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Tue, 20 Oct 1998 19:22:40 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, Inoue@tpf.co.jp, hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810201702.NAA08286@candle.pha.pa.us> from "Bruce Momjian" at Oct 20, 98 01:02:58 pm
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: ROr
|
|
|
|
>
|
|
> > >
|
|
> > > I agree. Another good thing is that the LIMIT thing will not require a
|
|
> > > dump/reload, so it is a good candidate for a minor release.
|
|
> >
|
|
> > That's wrong, sorry.
|
|
> >
|
|
> > The limit thing as I implemented it adds 2 new variables to
|
|
> > the Query structure. Rewrite rules are stored as querytrees
|
|
> > and in the existing pg_rewrite entries that would be missing.
|
|
>
|
|
> Oh, sorry. I forgot. That could be tough.
|
|
|
|
But it wouldn't hurt to add them now to have them in
|
|
place. The required out-, read- and copyfuncs are in
|
|
my patch too. This would prevent dump/load when we
|
|
later add the real LIMIT functionality. And it does
|
|
not change anything now.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Wed Oct 21 02:35:54 1998
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id CAA29494
|
|
for <maillist@candle.pha.pa.us>; Wed, 21 Oct 1998 02:35:53 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) with SMTP id CAA13326;
|
|
Wed, 21 Oct 1998 02:10:42 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Wed, 21 Oct 1998 02:09:35 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.8.8/8.8.8) id CAA12900
|
|
for pgsql-hackers-outgoing; Wed, 21 Oct 1998 02:09:33 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from candle.pha.pa.us (maillist@s5-03.ppp.op.net [209.152.195.67])
|
|
by hub.org (8.8.8/8.8.8) with ESMTP id CAA12871
|
|
for <hackers@postgreSQL.org>; Wed, 21 Oct 1998 02:09:26 -0400 (EDT)
|
|
(envelope-from maillist@candle.pha.pa.us)
|
|
Received: (from maillist@localhost)
|
|
by candle.pha.pa.us (8.9.0/8.9.0) id CAA27774;
|
|
Wed, 21 Oct 1998 02:09:27 -0400 (EDT)
|
|
From: Bruce Momjian <maillist@candle.pha.pa.us>
|
|
Message-Id: <199810210609.CAA27774@candle.pha.pa.us>
|
|
Subject: Re: [HACKERS] What about LIMIT in SELECT ?
|
|
In-Reply-To: <m0zVfUW-000EBPC@orion.SAPserv.Hamburg.dsh.de> from Jan Wieck at "Oct 20, 1998 7:22:40 pm"
|
|
To: jwieck@debis.com
|
|
Date: Wed, 21 Oct 1998 02:09:26 -0400 (EDT)
|
|
Cc: jwieck@debis.com, Inoue@tpf.co.jp, hackers@postgreSQL.org
|
|
X-Mailer: ELM [version 2.4ME+ PL47 (25)]
|
|
MIME-Version: 1.0
|
|
Content-Type: text/plain; charset=US-ASCII
|
|
Content-Transfer-Encoding: 7bit
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
> >
|
|
> > > >
|
|
> > > > I agree. Another good thing is that the LIMIT thing will not require a
|
|
> > > > dump/reload, so it is a good candidate for a minor release.
|
|
> > >
|
|
> > > That's wrong, sorry.
|
|
> > >
|
|
> > > The limit thing as I implemented it adds 2 new variables to
|
|
> > > the Query structure. Rewrite rules are stored as querytrees
|
|
> > > and in the existing pg_rewrite entries that would be missing.
|
|
> >
|
|
> > Oh, sorry. I forgot. That could be tough.
|
|
>
|
|
> But it wouldn't hurt to add them now to have them in
|
|
> place. The required out-, read- and copyfuncs are in
|
|
> my patch too. This would prevent dump/load when we
|
|
> later add the real LIMIT functionality. And it does
|
|
> not change anything now.
|
|
>
|
|
|
|
Jan, we found that I am having to require an initdb for the INET/CIDR
|
|
type, so if you want stuff to change the views/rules for the limit
|
|
addition post 6.4, please send them in and I will apply them.
|
|
|
|
You clearly have the syntax down, so I think you should go ahead.
|
|
|
|
|
|
--
|
|
Bruce Momjian | http://www.op.net/~candle
|
|
maillist@candle.pha.pa.us | (610) 853-3000
|
|
+ If your life is a hard drive, | 830 Blythe Avenue
|
|
+ Christ can be your backup. | Drexel Hill, Pennsylvania 19026
|
|
|
|
|
|
From wieck@sapserv.debis.de Thu Oct 22 10:20:58 1998
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id KAA20566
|
|
for <maillist@candle.pha.pa.us>; Thu, 22 Oct 1998 10:20:54 -0400 (EDT)
|
|
Received: by dsh.de; id QAA09067; Thu, 22 Oct 1998 16:23:14 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma008719; Thu, 22 Oct 98 16:22:40 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id QAA01558;
|
|
Thu, 22 Oct 1998 16:19:55 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id QAA18978;
|
|
Thu, 22 Oct 1998 16:22:20 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zWJG2-000B5AC; Thu, 22 Oct 98 13:50 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zWLoE-000EBPC; Thu, 22 Oct 98 16:33 MET DST
|
|
Message-Id: <m0zWLoE-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] psql's help (the LIMIT stuff)
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Thu, 22 Oct 1998 16:33:50 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, jose@sferacarta.com, pgsql-hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810221351.JAA19663@candle.pha.pa.us> from "Bruce Momjian" at Oct 22, 98 09:51:19 am
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Status: ROr
|
|
|
|
> >
|
|
> > I hope the QUERY_LIMIT too.
|
|
>
|
|
> I still have that cnfify() possible fix to review for KQSO. Are you
|
|
> still thinking limit for 6.4 final, or a minor release after that?
|
|
|
|
I posted the part that is the minimum applied to 6.4 to make
|
|
adding LIMIT later non-initdb earlier. Anyway, here it's
|
|
again.
|
|
|
|
My LIMIT implementation that does it like the SET in the
|
|
toplevel executor (but via parsetree values) is ready for
|
|
production. I only held it back because it's feature, not
|
|
bugfix.
|
|
|
|
Do you want it in 6.4 final?
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
diff -cr src.orig/backend/nodes/copyfuncs.c src/backend/nodes/copyfuncs.c
|
|
*** src.orig/backend/nodes/copyfuncs.c Fri Oct 16 11:53:40 1998
|
|
--- src/backend/nodes/copyfuncs.c Fri Oct 16 13:32:35 1998
|
|
***************
|
|
*** 1578,1583 ****
|
|
--- 1578,1586 ----
|
|
newnode->unionClause = temp_list;
|
|
}
|
|
|
|
+ Node_Copy(from, newnode, limitOffset);
|
|
+ Node_Copy(from, newnode, limitCount);
|
|
+
|
|
return newnode;
|
|
}
|
|
|
|
diff -cr src.orig/backend/nodes/outfuncs.c src/backend/nodes/outfuncs.c
|
|
*** src.orig/backend/nodes/outfuncs.c Fri Oct 16 11:53:40 1998
|
|
--- src/backend/nodes/outfuncs.c Fri Oct 16 13:30:50 1998
|
|
***************
|
|
*** 259,264 ****
|
|
--- 259,268 ----
|
|
appendStringInfo(str, (node->hasSubLinks ? "true" : "false"));
|
|
appendStringInfo(str, " :unionClause ");
|
|
_outNode(str, node->unionClause);
|
|
+ appendStringInfo(str, " :limitOffset ");
|
|
+ _outNode(str, node->limitOffset);
|
|
+ appendStringInfo(str, " :limitCount ");
|
|
+ _outNode(str, node->limitCount);
|
|
}
|
|
|
|
static void
|
|
diff -cr src.orig/backend/nodes/readfuncs.c src/backend/nodes/readfuncs.c
|
|
*** src.orig/backend/nodes/readfuncs.c Fri Oct 16 11:53:40 1998
|
|
--- src/backend/nodes/readfuncs.c Fri Oct 16 13:31:43 1998
|
|
***************
|
|
*** 163,168 ****
|
|
--- 163,174 ----
|
|
token = lsptok(NULL, &length); /* skip :unionClause */
|
|
local_node->unionClause = nodeRead(true);
|
|
|
|
+ token = lsptok(NULL, &length); /* skip :limitOffset */
|
|
+ local_node->limitOffset = nodeRead(true);
|
|
+
|
|
+ token = lsptok(NULL, &length); /* skip :limitCount */
|
|
+ local_node->limitCount = nodeRead(true);
|
|
+
|
|
return local_node;
|
|
}
|
|
|
|
diff -cr src.orig/include/nodes/parsenodes.h src/include/nodes/parsenodes.h
|
|
*** src.orig/include/nodes/parsenodes.h Fri Oct 16 11:53:58 1998
|
|
--- src/include/nodes/parsenodes.h Fri Oct 16 13:35:32 1998
|
|
***************
|
|
*** 60,65 ****
|
|
--- 60,67 ----
|
|
|
|
List *unionClause; /* unions are linked under the previous
|
|
* query */
|
|
+ Node *limitOffset; /* # of result tuples to skip */
|
|
+ Node *limitCount; /* # of result tuples to return */
|
|
|
|
/* internal to planner */
|
|
List *base_rel_list; /* base relation list */
|
|
***************
|
|
*** 639,644 ****
|
|
--- 641,648 ----
|
|
char *portalname; /* the portal (cursor) to create */
|
|
bool binary; /* a binary (internal) portal? */
|
|
bool unionall; /* union without unique sort */
|
|
+ Node *limitOffset; /* # of result tuples to skip */
|
|
+ Node *limitCount; /* # of result tuples to return */
|
|
} SelectStmt;
|
|
|
|
|
|
|
|
From owner-pgsql-hackers@hub.org Thu Oct 22 11:33:41 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id LAA01724
|
|
for <maillist@candle.pha.pa.us>; Thu, 22 Oct 1998 11:33:31 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id LAA12702 for <maillist@candle.pha.pa.us>; Thu, 22 Oct 1998 11:25:02 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.9.1/8.8.8) with SMTP id KAA11023;
|
|
Thu, 22 Oct 1998 10:22:13 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Thu, 22 Oct 1998 10:21:07 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.9.1/8.8.8) id KAA10873
|
|
for pgsql-hackers-outgoing; Thu, 22 Oct 1998 10:21:05 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.9.1/8.8.8) with ESMTP id KAA10847
|
|
for <pgsql-hackers@postgreSQL.org>; Thu, 22 Oct 1998 10:21:00 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id QAA09067; Thu, 22 Oct 1998 16:23:14 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma008719; Thu, 22 Oct 98 16:22:40 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id QAA01558;
|
|
Thu, 22 Oct 1998 16:19:55 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id QAA18978;
|
|
Thu, 22 Oct 1998 16:22:20 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zWJG2-000B5AC; Thu, 22 Oct 98 13:50 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zWLoE-000EBPC; Thu, 22 Oct 98 16:33 MET DST
|
|
Message-Id: <m0zWLoE-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] psql's help (the LIMIT stuff)
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Thu, 22 Oct 1998 16:33:50 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, jose@sferacarta.com, pgsql-hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810221351.JAA19663@candle.pha.pa.us> from "Bruce Momjian" at Oct 22, 98 09:51:19 am
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: RO
|
|
|
|
> >
|
|
> > I hope the QUERY_LIMIT too.
|
|
>
|
|
> I still have that cnfify() possible fix to review for KQSO. Are you
|
|
> still thinking limit for 6.4 final, or a minor release after that?
|
|
|
|
I posted the part that is the minimum applied to 6.4 to make
|
|
adding LIMIT later non-initdb earlier. Anyway, here it's
|
|
again.
|
|
|
|
My LIMIT implementation that does it like the SET in the
|
|
toplevel executor (but via parsetree values) is ready for
|
|
production. I only held it back because it's feature, not
|
|
bugfix.
|
|
|
|
Do you want it in 6.4 final?
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
diff -cr src.orig/backend/nodes/copyfuncs.c src/backend/nodes/copyfuncs.c
|
|
*** src.orig/backend/nodes/copyfuncs.c Fri Oct 16 11:53:40 1998
|
|
--- src/backend/nodes/copyfuncs.c Fri Oct 16 13:32:35 1998
|
|
***************
|
|
*** 1578,1583 ****
|
|
--- 1578,1586 ----
|
|
newnode->unionClause = temp_list;
|
|
}
|
|
|
|
+ Node_Copy(from, newnode, limitOffset);
|
|
+ Node_Copy(from, newnode, limitCount);
|
|
+
|
|
return newnode;
|
|
}
|
|
|
|
diff -cr src.orig/backend/nodes/outfuncs.c src/backend/nodes/outfuncs.c
|
|
*** src.orig/backend/nodes/outfuncs.c Fri Oct 16 11:53:40 1998
|
|
--- src/backend/nodes/outfuncs.c Fri Oct 16 13:30:50 1998
|
|
***************
|
|
*** 259,264 ****
|
|
--- 259,268 ----
|
|
appendStringInfo(str, (node->hasSubLinks ? "true" : "false"));
|
|
appendStringInfo(str, " :unionClause ");
|
|
_outNode(str, node->unionClause);
|
|
+ appendStringInfo(str, " :limitOffset ");
|
|
+ _outNode(str, node->limitOffset);
|
|
+ appendStringInfo(str, " :limitCount ");
|
|
+ _outNode(str, node->limitCount);
|
|
}
|
|
|
|
static void
|
|
diff -cr src.orig/backend/nodes/readfuncs.c src/backend/nodes/readfuncs.c
|
|
*** src.orig/backend/nodes/readfuncs.c Fri Oct 16 11:53:40 1998
|
|
--- src/backend/nodes/readfuncs.c Fri Oct 16 13:31:43 1998
|
|
***************
|
|
*** 163,168 ****
|
|
--- 163,174 ----
|
|
token = lsptok(NULL, &length); /* skip :unionClause */
|
|
local_node->unionClause = nodeRead(true);
|
|
|
|
+ token = lsptok(NULL, &length); /* skip :limitOffset */
|
|
+ local_node->limitOffset = nodeRead(true);
|
|
+
|
|
+ token = lsptok(NULL, &length); /* skip :limitCount */
|
|
+ local_node->limitCount = nodeRead(true);
|
|
+
|
|
return local_node;
|
|
}
|
|
|
|
diff -cr src.orig/include/nodes/parsenodes.h src/include/nodes/parsenodes.h
|
|
*** src.orig/include/nodes/parsenodes.h Fri Oct 16 11:53:58 1998
|
|
--- src/include/nodes/parsenodes.h Fri Oct 16 13:35:32 1998
|
|
***************
|
|
*** 60,65 ****
|
|
--- 60,67 ----
|
|
|
|
List *unionClause; /* unions are linked under the previous
|
|
* query */
|
|
+ Node *limitOffset; /* # of result tuples to skip */
|
|
+ Node *limitCount; /* # of result tuples to return */
|
|
|
|
/* internal to planner */
|
|
List *base_rel_list; /* base relation list */
|
|
***************
|
|
*** 639,644 ****
|
|
--- 641,648 ----
|
|
char *portalname; /* the portal (cursor) to create */
|
|
bool binary; /* a binary (internal) portal? */
|
|
bool unionall; /* union without unique sort */
|
|
+ Node *limitOffset; /* # of result tuples to skip */
|
|
+ Node *limitCount; /* # of result tuples to return */
|
|
} SelectStmt;
|
|
|
|
|
|
|
|
|
|
From wieck@sapserv.debis.de Thu Oct 22 11:01:05 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id LAA21185
|
|
for <maillist@candle.pha.pa.us>; Thu, 22 Oct 1998 11:01:00 -0400 (EDT)
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id KAA09646 for <maillist@candle.pha.pa.us>; Thu, 22 Oct 1998 10:44:36 -0400 (EDT)
|
|
Received: by dsh.de; id QAA19394; Thu, 22 Oct 1998 16:43:42 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma017268; Thu, 22 Oct 98 16:39:44 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id QAA02988;
|
|
Thu, 22 Oct 1998 16:36:46 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id QAA19155;
|
|
Thu, 22 Oct 1998 16:39:10 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zWJWL-000B5DC; Thu, 22 Oct 98 14:07 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zWM4W-000EBPC; Thu, 22 Oct 98 16:50 MET DST
|
|
Message-Id: <m0zWM4W-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] psql's help (the LIMIT stuff)
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Thu, 22 Oct 1998 16:50:40 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, jose@sferacarta.com, pgsql-hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810221424.KAA20601@candle.pha.pa.us> from "Bruce Momjian" at Oct 22, 98 10:24:08 am
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Status: RO
|
|
|
|
>
|
|
> > > >
|
|
> > > > I hope the QUERY_LIMIT too.
|
|
> > >
|
|
> > > I still have that cnfify() possible fix to review for KQSO. Are you
|
|
> > > still thinking limit for 6.4 final, or a minor release after that?
|
|
> >
|
|
> > I posted the part that is the minimum applied to 6.4 to make
|
|
> > adding LIMIT later non-initdb earlier. Anyway, here it's
|
|
> > again.
|
|
>
|
|
> Already applied. I assume it is the same as the one I applied.
|
|
|
|
Seen, thanks. Your 'Applied' just arrived after I packed it
|
|
again. It's the same.
|
|
|
|
> We are close to final, and can easily put it in 6.4.1, which I am sure
|
|
> we will need, and if we split CVS trees, you'll have lots of minor
|
|
> versions to pick from. :-)
|
|
>
|
|
> Seems like it would be a nice minor release item, but the problem is
|
|
> that minor releases aren't tested as much as major ones. How confident
|
|
> are you in the code? What do others thing?
|
|
|
|
I regression tested it, and did additional tests in the
|
|
SPI/PL area. It works. It only touches the parser and the
|
|
executor. Rules, planner/optimizer just bypass the values in
|
|
the parsetree. The parser and the executor are parts of
|
|
Postgres I feel very familiar with (not so in the optimizer).
|
|
I trust in the code and would use it in a production
|
|
environment.
|
|
|
|
It's below.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
diff -cr src.orig/backend/commands/command.c src/backend/commands/command.c
|
|
*** src.orig/backend/commands/command.c Fri Oct 16 11:53:38 1998
|
|
--- src/backend/commands/command.c Fri Oct 16 12:56:44 1998
|
|
***************
|
|
*** 39,44 ****
|
|
--- 39,45 ----
|
|
#include "utils/mcxt.h"
|
|
#include "utils/portal.h"
|
|
#include "utils/syscache.h"
|
|
+ #include "string.h"
|
|
|
|
/* ----------------
|
|
* PortalExecutorHeapMemory stuff
|
|
***************
|
|
*** 101,106 ****
|
|
--- 102,108 ----
|
|
int feature;
|
|
QueryDesc *queryDesc;
|
|
MemoryContext context;
|
|
+ Const limcount;
|
|
|
|
/* ----------------
|
|
* sanity checks
|
|
***************
|
|
*** 113,118 ****
|
|
--- 115,134 ----
|
|
}
|
|
|
|
/* ----------------
|
|
+ * Create a const node from the given count value
|
|
+ * ----------------
|
|
+ */
|
|
+ memset(&limcount, 0, sizeof(limcount));
|
|
+ limcount.type = T_Const;
|
|
+ limcount.consttype = INT4OID;
|
|
+ limcount.constlen = sizeof(int4);
|
|
+ limcount.constvalue = (Datum)count;
|
|
+ limcount.constisnull = FALSE;
|
|
+ limcount.constbyval = TRUE;
|
|
+ limcount.constisset = FALSE;
|
|
+ limcount.constiscast = FALSE;
|
|
+
|
|
+ /* ----------------
|
|
* get the portal from the portal name
|
|
* ----------------
|
|
*/
|
|
***************
|
|
*** 176,182 ****
|
|
PortalExecutorHeapMemory = (MemoryContext)
|
|
PortalGetHeapMemory(portal);
|
|
|
|
! ExecutorRun(queryDesc, PortalGetState(portal), feature, count);
|
|
|
|
if (dest == None) /* MOVE */
|
|
pfree(queryDesc);
|
|
--- 192,198 ----
|
|
PortalExecutorHeapMemory = (MemoryContext)
|
|
PortalGetHeapMemory(portal);
|
|
|
|
! ExecutorRun(queryDesc, PortalGetState(portal), feature, (Node *)NULL, (Node *)&limcount);
|
|
|
|
if (dest == None) /* MOVE */
|
|
pfree(queryDesc);
|
|
diff -cr src.orig/backend/executor/execMain.c src/backend/executor/execMain.c
|
|
*** src.orig/backend/executor/execMain.c Fri Oct 16 11:53:38 1998
|
|
--- src/backend/executor/execMain.c Fri Oct 16 20:05:19 1998
|
|
***************
|
|
*** 64,69 ****
|
|
--- 64,70 ----
|
|
static void EndPlan(Plan *plan, EState *estate);
|
|
static TupleTableSlot *ExecutePlan(EState *estate, Plan *plan,
|
|
Query *parseTree, CmdType operation,
|
|
+ int offsetTuples,
|
|
int numberTuples, ScanDirection direction,
|
|
void (*printfunc) ());
|
|
static void ExecRetrieve(TupleTableSlot *slot, void (*printfunc) (),
|
|
***************
|
|
*** 163,169 ****
|
|
* ----------------------------------------------------------------
|
|
*/
|
|
TupleTableSlot *
|
|
! ExecutorRun(QueryDesc *queryDesc, EState *estate, int feature, int count)
|
|
{
|
|
CmdType operation;
|
|
Query *parseTree;
|
|
--- 164,170 ----
|
|
* ----------------------------------------------------------------
|
|
*/
|
|
TupleTableSlot *
|
|
! ExecutorRun(QueryDesc *queryDesc, EState *estate, int feature, Node *limoffset, Node *limcount)
|
|
{
|
|
CmdType operation;
|
|
Query *parseTree;
|
|
***************
|
|
*** 171,176 ****
|
|
--- 172,179 ----
|
|
TupleTableSlot *result;
|
|
CommandDest dest;
|
|
void (*destination) ();
|
|
+ int offset = 0;
|
|
+ int count = 0;
|
|
|
|
/******************
|
|
* sanity checks
|
|
***************
|
|
*** 191,196 ****
|
|
--- 194,289 ----
|
|
estate->es_processed = 0;
|
|
estate->es_lastoid = InvalidOid;
|
|
|
|
+ /******************
|
|
+ * if given get the offset of the LIMIT clause
|
|
+ ******************
|
|
+ */
|
|
+ if (limoffset != NULL)
|
|
+ {
|
|
+ Const *coffset;
|
|
+ Param *poffset;
|
|
+ ParamListInfo paramLI;
|
|
+ int i;
|
|
+
|
|
+ switch (nodeTag(limoffset))
|
|
+ {
|
|
+ case T_Const:
|
|
+ coffset = (Const *)limoffset;
|
|
+ offset = (int)(coffset->constvalue);
|
|
+ break;
|
|
+
|
|
+ case T_Param:
|
|
+ poffset = (Param *)limoffset;
|
|
+ paramLI = estate->es_param_list_info;
|
|
+
|
|
+ if (paramLI == NULL)
|
|
+ elog(ERROR, "parameter for limit offset not in executor state");
|
|
+ for (i = 0; paramLI[i].kind != PARAM_INVALID; i++)
|
|
+ {
|
|
+ if (paramLI[i].kind == PARAM_NUM && paramLI[i].id == poffset->paramid)
|
|
+ break;
|
|
+ }
|
|
+ if (paramLI[i].kind == PARAM_INVALID)
|
|
+ elog(ERROR, "parameter for limit offset not in executor state");
|
|
+ if (paramLI[i].isnull)
|
|
+ elog(ERROR, "limit offset cannot be NULL value");
|
|
+ offset = (int)(paramLI[i].value);
|
|
+
|
|
+ break;
|
|
+
|
|
+ default:
|
|
+ elog(ERROR, "unexpected node type %d as limit offset", nodeTag(limoffset));
|
|
+ }
|
|
+
|
|
+ if (offset < 0)
|
|
+ elog(ERROR, "limit offset cannot be negative");
|
|
+ }
|
|
+
|
|
+ /******************
|
|
+ * if given get the count of the LIMIT clause
|
|
+ ******************
|
|
+ */
|
|
+ if (limcount != NULL)
|
|
+ {
|
|
+ Const *ccount;
|
|
+ Param *pcount;
|
|
+ ParamListInfo paramLI;
|
|
+ int i;
|
|
+
|
|
+ switch (nodeTag(limcount))
|
|
+ {
|
|
+ case T_Const:
|
|
+ ccount = (Const *)limcount;
|
|
+ count = (int)(ccount->constvalue);
|
|
+ break;
|
|
+
|
|
+ case T_Param:
|
|
+ pcount = (Param *)limcount;
|
|
+ paramLI = estate->es_param_list_info;
|
|
+
|
|
+ if (paramLI == NULL)
|
|
+ elog(ERROR, "parameter for limit count not in executor state");
|
|
+ for (i = 0; paramLI[i].kind != PARAM_INVALID; i++)
|
|
+ {
|
|
+ if (paramLI[i].kind == PARAM_NUM && paramLI[i].id == pcount->paramid)
|
|
+ break;
|
|
+ }
|
|
+ if (paramLI[i].kind == PARAM_INVALID)
|
|
+ elog(ERROR, "parameter for limit count not in executor state");
|
|
+ if (paramLI[i].isnull)
|
|
+ elog(ERROR, "limit count cannot be NULL value");
|
|
+ count = (int)(paramLI[i].value);
|
|
+
|
|
+ break;
|
|
+
|
|
+ default:
|
|
+ elog(ERROR, "unexpected node type %d as limit count", nodeTag(limcount));
|
|
+ }
|
|
+
|
|
+ if (count < 0)
|
|
+ elog(ERROR, "limit count cannot be negative");
|
|
+ }
|
|
+
|
|
switch (feature)
|
|
{
|
|
|
|
***************
|
|
*** 199,205 ****
|
|
plan,
|
|
parseTree,
|
|
operation,
|
|
! ALL_TUPLES,
|
|
ForwardScanDirection,
|
|
destination);
|
|
break;
|
|
--- 292,299 ----
|
|
plan,
|
|
parseTree,
|
|
operation,
|
|
! offset,
|
|
! count,
|
|
ForwardScanDirection,
|
|
destination);
|
|
break;
|
|
***************
|
|
*** 208,213 ****
|
|
--- 302,308 ----
|
|
plan,
|
|
parseTree,
|
|
operation,
|
|
+ offset,
|
|
count,
|
|
ForwardScanDirection,
|
|
destination);
|
|
***************
|
|
*** 222,227 ****
|
|
--- 317,323 ----
|
|
plan,
|
|
parseTree,
|
|
operation,
|
|
+ offset,
|
|
count,
|
|
BackwardScanDirection,
|
|
destination);
|
|
***************
|
|
*** 237,242 ****
|
|
--- 333,339 ----
|
|
plan,
|
|
parseTree,
|
|
operation,
|
|
+ 0,
|
|
ONE_TUPLE,
|
|
ForwardScanDirection,
|
|
destination);
|
|
***************
|
|
*** 691,696 ****
|
|
--- 788,794 ----
|
|
Plan *plan,
|
|
Query *parseTree,
|
|
CmdType operation,
|
|
+ int offsetTuples,
|
|
int numberTuples,
|
|
ScanDirection direction,
|
|
void (*printfunc) ())
|
|
***************
|
|
*** 742,747 ****
|
|
--- 840,859 ----
|
|
{
|
|
result = NULL;
|
|
break;
|
|
+ }
|
|
+
|
|
+ /******************
|
|
+ * For now we completely execute the plan and skip
|
|
+ * result tuples if requested by LIMIT offset.
|
|
+ * Finally we should try to do it in deeper levels
|
|
+ * if possible (during index scan)
|
|
+ * - Jan
|
|
+ ******************
|
|
+ */
|
|
+ if (offsetTuples > 0)
|
|
+ {
|
|
+ --offsetTuples;
|
|
+ continue;
|
|
}
|
|
|
|
/******************
|
|
diff -cr src.orig/backend/executor/functions.c src/backend/executor/functions.c
|
|
*** src.orig/backend/executor/functions.c Fri Oct 16 11:53:38 1998
|
|
--- src/backend/executor/functions.c Fri Oct 16 19:01:02 1998
|
|
***************
|
|
*** 130,135 ****
|
|
--- 130,138 ----
|
|
None);
|
|
estate = CreateExecutorState();
|
|
|
|
+ if (queryTree->limitOffset != NULL || queryTree->limitCount != NULL)
|
|
+ elog(ERROR, "LIMIT clause from SQL functions not yet implemented");
|
|
+
|
|
if (nargs > 0)
|
|
{
|
|
int i;
|
|
***************
|
|
*** 200,206 ****
|
|
|
|
feature = (LAST_POSTQUEL_COMMAND(es)) ? EXEC_RETONE : EXEC_RUN;
|
|
|
|
! return ExecutorRun(es->qd, es->estate, feature, 0);
|
|
}
|
|
|
|
static void
|
|
--- 203,209 ----
|
|
|
|
feature = (LAST_POSTQUEL_COMMAND(es)) ? EXEC_RETONE : EXEC_RUN;
|
|
|
|
! return ExecutorRun(es->qd, es->estate, feature, (Node *)NULL, (Node *)NULL);
|
|
}
|
|
|
|
static void
|
|
diff -cr src.orig/backend/executor/spi.c src/backend/executor/spi.c
|
|
*** src.orig/backend/executor/spi.c Fri Oct 16 11:53:39 1998
|
|
--- src/backend/executor/spi.c Fri Oct 16 19:25:33 1998
|
|
***************
|
|
*** 791,796 ****
|
|
--- 791,798 ----
|
|
bool isRetrieveIntoRelation = false;
|
|
char *intoName = NULL;
|
|
int res;
|
|
+ Const tcount_const;
|
|
+ Node *count = NULL;
|
|
|
|
switch (operation)
|
|
{
|
|
***************
|
|
*** 825,830 ****
|
|
--- 827,865 ----
|
|
return SPI_ERROR_OPUNKNOWN;
|
|
}
|
|
|
|
+ /* ----------------
|
|
+ * Get the query LIMIT tuple count
|
|
+ * ----------------
|
|
+ */
|
|
+ if (parseTree->limitCount != NULL)
|
|
+ {
|
|
+ /* ----------------
|
|
+ * A limit clause in the parsetree overrides the
|
|
+ * tcount parameter
|
|
+ * ----------------
|
|
+ */
|
|
+ count = parseTree->limitCount;
|
|
+ }
|
|
+ else
|
|
+ {
|
|
+ /* ----------------
|
|
+ * No LIMIT clause in parsetree. Use a local Const node
|
|
+ * to put tcount into it
|
|
+ * ----------------
|
|
+ */
|
|
+ memset(&tcount_const, 0, sizeof(tcount_const));
|
|
+ tcount_const.type = T_Const;
|
|
+ tcount_const.consttype = INT4OID;
|
|
+ tcount_const.constlen = sizeof(int4);
|
|
+ tcount_const.constvalue = (Datum)tcount;
|
|
+ tcount_const.constisnull = FALSE;
|
|
+ tcount_const.constbyval = TRUE;
|
|
+ tcount_const.constisset = FALSE;
|
|
+ tcount_const.constiscast = FALSE;
|
|
+
|
|
+ count = (Node *)&tcount_const;
|
|
+ }
|
|
+
|
|
if (state == NULL) /* plan preparation */
|
|
return res;
|
|
#ifdef SPI_EXECUTOR_STATS
|
|
***************
|
|
*** 845,851 ****
|
|
return SPI_OK_CURSOR;
|
|
}
|
|
|
|
! ExecutorRun(queryDesc, state, EXEC_FOR, tcount);
|
|
|
|
_SPI_current->processed = state->es_processed;
|
|
if (operation == CMD_SELECT && queryDesc->dest == SPI)
|
|
--- 880,886 ----
|
|
return SPI_OK_CURSOR;
|
|
}
|
|
|
|
! ExecutorRun(queryDesc, state, EXEC_FOR, parseTree->limitOffset, count);
|
|
|
|
_SPI_current->processed = state->es_processed;
|
|
if (operation == CMD_SELECT && queryDesc->dest == SPI)
|
|
diff -cr src.orig/backend/parser/analyze.c src/backend/parser/analyze.c
|
|
*** src.orig/backend/parser/analyze.c Fri Oct 16 11:53:41 1998
|
|
--- src/backend/parser/analyze.c Fri Oct 16 13:29:27 1998
|
|
***************
|
|
*** 180,186 ****
|
|
--- 180,190 ----
|
|
|
|
case T_SelectStmt:
|
|
if (!((SelectStmt *) parseTree)->portalname)
|
|
+ {
|
|
result = transformSelectStmt(pstate, (SelectStmt *) parseTree);
|
|
+ result->limitOffset = ((SelectStmt *)parseTree)->limitOffset;
|
|
+ result->limitCount = ((SelectStmt *)parseTree)->limitCount;
|
|
+ }
|
|
else
|
|
result = transformCursorStmt(pstate, (SelectStmt *) parseTree);
|
|
break;
|
|
diff -cr src.orig/backend/parser/gram.y src/backend/parser/gram.y
|
|
*** src.orig/backend/parser/gram.y Fri Oct 16 11:53:42 1998
|
|
--- src/backend/parser/gram.y Sun Oct 18 22:20:36 1998
|
|
***************
|
|
*** 45,50 ****
|
|
--- 45,51 ----
|
|
#include "catalog/catname.h"
|
|
#include "utils/elog.h"
|
|
#include "access/xact.h"
|
|
+ #include "catalog/pg_type.h"
|
|
|
|
#ifdef MULTIBYTE
|
|
#include "mb/pg_wchar.h"
|
|
***************
|
|
*** 163,169 ****
|
|
sort_clause, sortby_list, index_params, index_list, name_list,
|
|
from_clause, from_list, opt_array_bounds, nest_array_bounds,
|
|
expr_list, attrs, res_target_list, res_target_list2,
|
|
! def_list, opt_indirection, group_clause, TriggerFuncArgs
|
|
|
|
%type <node> func_return
|
|
%type <boolean> set_opt
|
|
--- 164,171 ----
|
|
sort_clause, sortby_list, index_params, index_list, name_list,
|
|
from_clause, from_list, opt_array_bounds, nest_array_bounds,
|
|
expr_list, attrs, res_target_list, res_target_list2,
|
|
! def_list, opt_indirection, group_clause, TriggerFuncArgs,
|
|
! opt_select_limit
|
|
|
|
%type <node> func_return
|
|
%type <boolean> set_opt
|
|
***************
|
|
*** 192,197 ****
|
|
--- 194,201 ----
|
|
|
|
%type <ival> fetch_how_many
|
|
|
|
+ %type <node> select_limit_value select_offset_value
|
|
+
|
|
%type <list> OptSeqList
|
|
%type <defelt> OptSeqElem
|
|
|
|
***************
|
|
*** 267,273 ****
|
|
FALSE_P, FETCH, FLOAT, FOR, FOREIGN, FROM, FULL,
|
|
GRANT, GROUP, HAVING, HOUR_P,
|
|
IN, INNER_P, INSENSITIVE, INSERT, INTERVAL, INTO, IS,
|
|
! JOIN, KEY, LANGUAGE, LEADING, LEFT, LIKE, LOCAL,
|
|
MATCH, MINUTE_P, MONTH_P, NAMES,
|
|
NATIONAL, NATURAL, NCHAR, NEXT, NO, NOT, NULL_P, NUMERIC,
|
|
OF, ON, ONLY, OPTION, OR, ORDER, OUTER_P,
|
|
--- 271,277 ----
|
|
FALSE_P, FETCH, FLOAT, FOR, FOREIGN, FROM, FULL,
|
|
GRANT, GROUP, HAVING, HOUR_P,
|
|
IN, INNER_P, INSENSITIVE, INSERT, INTERVAL, INTO, IS,
|
|
! JOIN, KEY, LANGUAGE, LEADING, LEFT, LIKE, LIMIT, LOCAL,
|
|
MATCH, MINUTE_P, MONTH_P, NAMES,
|
|
NATIONAL, NATURAL, NCHAR, NEXT, NO, NOT, NULL_P, NUMERIC,
|
|
OF, ON, ONLY, OPTION, OR, ORDER, OUTER_P,
|
|
***************
|
|
*** 299,305 ****
|
|
INCREMENT, INDEX, INHERITS, INSTEAD, ISNULL,
|
|
LANCOMPILER, LISTEN, LOAD, LOCATION, LOCK_P, MAXVALUE, MINVALUE, MOVE,
|
|
NEW, NOCREATEDB, NOCREATEUSER, NONE, NOTHING, NOTIFY, NOTNULL,
|
|
! OIDS, OPERATOR, PASSWORD, PROCEDURAL,
|
|
RECIPE, RENAME, RESET, RETURNS, ROW, RULE,
|
|
SEQUENCE, SERIAL, SETOF, SHOW, START, STATEMENT, STDIN, STDOUT, TRUSTED,
|
|
UNLISTEN, UNTIL, VACUUM, VALID, VERBOSE, VERSION
|
|
--- 303,309 ----
|
|
INCREMENT, INDEX, INHERITS, INSTEAD, ISNULL,
|
|
LANCOMPILER, LISTEN, LOAD, LOCATION, LOCK_P, MAXVALUE, MINVALUE, MOVE,
|
|
NEW, NOCREATEDB, NOCREATEUSER, NONE, NOTHING, NOTIFY, NOTNULL,
|
|
! OFFSET, OIDS, OPERATOR, PASSWORD, PROCEDURAL,
|
|
RECIPE, RENAME, RESET, RETURNS, ROW, RULE,
|
|
SEQUENCE, SERIAL, SETOF, SHOW, START, STATEMENT, STDIN, STDOUT, TRUSTED,
|
|
UNLISTEN, UNTIL, VACUUM, VALID, VERBOSE, VERSION
|
|
***************
|
|
*** 2591,2596 ****
|
|
--- 2595,2601 ----
|
|
result from_clause where_clause
|
|
group_clause having_clause
|
|
union_clause sort_clause
|
|
+ opt_select_limit
|
|
{
|
|
SelectStmt *n = makeNode(SelectStmt);
|
|
n->unique = $2;
|
|
***************
|
|
*** 2602,2607 ****
|
|
--- 2607,2622 ----
|
|
n->havingClause = $8;
|
|
n->unionClause = $9;
|
|
n->sortClause = $10;
|
|
+ if ($11 != NIL)
|
|
+ {
|
|
+ n->limitOffset = nth(0, $11);
|
|
+ n->limitCount = nth(1, $11);
|
|
+ }
|
|
+ else
|
|
+ {
|
|
+ n->limitOffset = NULL;
|
|
+ n->limitCount = NULL;
|
|
+ }
|
|
$$ = (Node *)n;
|
|
}
|
|
;
|
|
***************
|
|
*** 2699,2704 ****
|
|
--- 2714,2794 ----
|
|
| ASC { $$ = "<"; }
|
|
| DESC { $$ = ">"; }
|
|
| /*EMPTY*/ { $$ = "<"; /*default*/ }
|
|
+ ;
|
|
+
|
|
+ opt_select_limit: LIMIT select_offset_value ',' select_limit_value
|
|
+ { $$ = lappend(lappend(NIL, $2), $4); }
|
|
+ | LIMIT select_limit_value OFFSET select_offset_value
|
|
+ { $$ = lappend(lappend(NIL, $4), $2); }
|
|
+ | LIMIT select_limit_value
|
|
+ { $$ = lappend(lappend(NIL, NULL), $2); }
|
|
+ | OFFSET select_offset_value LIMIT select_limit_value
|
|
+ { $$ = lappend(lappend(NIL, $2), $4); }
|
|
+ | OFFSET select_offset_value
|
|
+ { $$ = lappend(lappend(NIL, $2), NULL); }
|
|
+ | /* EMPTY */
|
|
+ { $$ = NIL; }
|
|
+ ;
|
|
+
|
|
+ select_limit_value: Iconst
|
|
+ {
|
|
+ Const *n = makeNode(Const);
|
|
+
|
|
+ if ($1 < 1)
|
|
+ elog(ERROR, "selection limit must be ALL or a positive integer value > 0");
|
|
+
|
|
+ n->consttype = INT4OID;
|
|
+ n->constlen = sizeof(int4);
|
|
+ n->constvalue = (Datum)$1;
|
|
+ n->constisnull = FALSE;
|
|
+ n->constbyval = TRUE;
|
|
+ n->constisset = FALSE;
|
|
+ n->constiscast = FALSE;
|
|
+ $$ = (Node *)n;
|
|
+ }
|
|
+ | ALL
|
|
+ {
|
|
+ Const *n = makeNode(Const);
|
|
+ n->consttype = INT4OID;
|
|
+ n->constlen = sizeof(int4);
|
|
+ n->constvalue = (Datum)0;
|
|
+ n->constisnull = FALSE;
|
|
+ n->constbyval = TRUE;
|
|
+ n->constisset = FALSE;
|
|
+ n->constiscast = FALSE;
|
|
+ $$ = (Node *)n;
|
|
+ }
|
|
+ | PARAM
|
|
+ {
|
|
+ Param *n = makeNode(Param);
|
|
+ n->paramkind = PARAM_NUM;
|
|
+ n->paramid = $1;
|
|
+ n->paramtype = INT4OID;
|
|
+ $$ = (Node *)n;
|
|
+ }
|
|
+ ;
|
|
+
|
|
+ select_offset_value: Iconst
|
|
+ {
|
|
+ Const *n = makeNode(Const);
|
|
+
|
|
+ n->consttype = INT4OID;
|
|
+ n->constlen = sizeof(int4);
|
|
+ n->constvalue = (Datum)$1;
|
|
+ n->constisnull = FALSE;
|
|
+ n->constbyval = TRUE;
|
|
+ n->constisset = FALSE;
|
|
+ n->constiscast = FALSE;
|
|
+ $$ = (Node *)n;
|
|
+ }
|
|
+ | PARAM
|
|
+ {
|
|
+ Param *n = makeNode(Param);
|
|
+ n->paramkind = PARAM_NUM;
|
|
+ n->paramid = $1;
|
|
+ n->paramtype = INT4OID;
|
|
+ $$ = (Node *)n;
|
|
+ }
|
|
;
|
|
|
|
/*
|
|
diff -cr src.orig/backend/parser/keywords.c src/backend/parser/keywords.c
|
|
*** src.orig/backend/parser/keywords.c Fri Oct 16 11:53:42 1998
|
|
--- src/backend/parser/keywords.c Sun Oct 18 22:13:29 1998
|
|
***************
|
|
*** 128,133 ****
|
|
--- 128,134 ----
|
|
{"leading", LEADING},
|
|
{"left", LEFT},
|
|
{"like", LIKE},
|
|
+ {"limit", LIMIT},
|
|
{"listen", LISTEN},
|
|
{"load", LOAD},
|
|
{"local", LOCAL},
|
|
***************
|
|
*** 156,161 ****
|
|
--- 157,163 ----
|
|
{"null", NULL_P},
|
|
{"numeric", NUMERIC},
|
|
{"of", OF},
|
|
+ {"offset", OFFSET},
|
|
{"oids", OIDS},
|
|
{"old", CURRENT},
|
|
{"on", ON},
|
|
diff -cr src.orig/backend/rewrite/rewriteDefine.c src/backend/rewrite/rewriteDefine.c
|
|
*** src.orig/backend/rewrite/rewriteDefine.c Fri Oct 16 11:53:46 1998
|
|
--- src/backend/rewrite/rewriteDefine.c Fri Oct 16 13:48:55 1998
|
|
***************
|
|
*** 312,317 ****
|
|
--- 312,323 ----
|
|
heap_close(event_relation);
|
|
|
|
/*
|
|
+ * LIMIT in view is not supported
|
|
+ */
|
|
+ if (query->limitOffset != NULL || query->limitCount != NULL)
|
|
+ elog(ERROR, "LIMIT clause not supported in views");
|
|
+
|
|
+ /*
|
|
* ... and finally the rule must be named _RETviewname.
|
|
*/
|
|
sprintf(expected_name, "_RET%s", event_obj->relname);
|
|
diff -cr src.orig/backend/tcop/pquery.c src/backend/tcop/pquery.c
|
|
*** src.orig/backend/tcop/pquery.c Fri Oct 16 11:53:47 1998
|
|
--- src/backend/tcop/pquery.c Fri Oct 16 14:02:36 1998
|
|
***************
|
|
*** 40,46 ****
|
|
#include "commands/command.h"
|
|
|
|
static char *CreateOperationTag(int operationType);
|
|
! static void ProcessQueryDesc(QueryDesc *queryDesc);
|
|
|
|
|
|
/* ----------------------------------------------------------------
|
|
--- 40,46 ----
|
|
#include "commands/command.h"
|
|
|
|
static char *CreateOperationTag(int operationType);
|
|
! static void ProcessQueryDesc(QueryDesc *queryDesc, Node *limoffset, Node *limcount);
|
|
|
|
|
|
/* ----------------------------------------------------------------
|
|
***************
|
|
*** 205,211 ****
|
|
* ----------------------------------------------------------------
|
|
*/
|
|
static void
|
|
! ProcessQueryDesc(QueryDesc *queryDesc)
|
|
{
|
|
Query *parseTree;
|
|
Plan *plan;
|
|
--- 205,211 ----
|
|
* ----------------------------------------------------------------
|
|
*/
|
|
static void
|
|
! ProcessQueryDesc(QueryDesc *queryDesc, Node *limoffset, Node *limcount)
|
|
{
|
|
Query *parseTree;
|
|
Plan *plan;
|
|
***************
|
|
*** 330,336 ****
|
|
* actually run the plan..
|
|
* ----------------
|
|
*/
|
|
! ExecutorRun(queryDesc, state, EXEC_RUN, 0);
|
|
|
|
/* save infos for EndCommand */
|
|
UpdateCommandInfo(operation, state->es_lastoid, state->es_processed);
|
|
--- 330,336 ----
|
|
* actually run the plan..
|
|
* ----------------
|
|
*/
|
|
! ExecutorRun(queryDesc, state, EXEC_RUN, limoffset, limcount);
|
|
|
|
/* save infos for EndCommand */
|
|
UpdateCommandInfo(operation, state->es_lastoid, state->es_processed);
|
|
***************
|
|
*** 373,377 ****
|
|
print_plan(plan, parsetree);
|
|
}
|
|
else
|
|
! ProcessQueryDesc(queryDesc);
|
|
}
|
|
--- 373,377 ----
|
|
print_plan(plan, parsetree);
|
|
}
|
|
else
|
|
! ProcessQueryDesc(queryDesc, parsetree->limitOffset, parsetree->limitCount);
|
|
}
|
|
diff -cr src.orig/include/executor/executor.h src/include/executor/executor.h
|
|
*** src.orig/include/executor/executor.h Fri Oct 16 11:53:56 1998
|
|
--- src/include/executor/executor.h Fri Oct 16 12:04:17 1998
|
|
***************
|
|
*** 83,89 ****
|
|
* prototypes from functions in execMain.c
|
|
*/
|
|
extern TupleDesc ExecutorStart(QueryDesc *queryDesc, EState *estate);
|
|
! extern TupleTableSlot *ExecutorRun(QueryDesc *queryDesc, EState *estate, int feature, int count);
|
|
extern void ExecutorEnd(QueryDesc *queryDesc, EState *estate);
|
|
extern HeapTuple ExecConstraints(char *caller, Relation rel, HeapTuple tuple);
|
|
#ifdef QUERY_LIMIT
|
|
--- 83,89 ----
|
|
* prototypes from functions in execMain.c
|
|
*/
|
|
extern TupleDesc ExecutorStart(QueryDesc *queryDesc, EState *estate);
|
|
! extern TupleTableSlot *ExecutorRun(QueryDesc *queryDesc, EState *estate, int feature, Node *limoffset, Node *limcount);
|
|
extern void ExecutorEnd(QueryDesc *queryDesc, EState *estate);
|
|
extern HeapTuple ExecConstraints(char *caller, Relation rel, HeapTuple tuple);
|
|
#ifdef QUERY_LIMIT
|
|
|
|
From owner-pgsql-hackers@hub.org Thu Oct 22 13:12:34 1998
|
|
Received: from renoir.op.net (root@renoir.op.net [209.152.193.4])
|
|
by candle.pha.pa.us (8.9.0/8.9.0) with ESMTP id NAA01350
|
|
for <maillist@candle.pha.pa.us>; Thu, 22 Oct 1998 13:12:29 -0400 (EDT)
|
|
Received: from hub.org (majordom@hub.org [209.47.148.200]) by renoir.op.net (o1/$ Revision: 1.18 $) with ESMTP id MAA17808 for <maillist@candle.pha.pa.us>; Thu, 22 Oct 1998 12:35:22 -0400 (EDT)
|
|
Received: from localhost (majordom@localhost)
|
|
by hub.org (8.9.1/8.8.8) with SMTP id KAA14887;
|
|
Thu, 22 Oct 1998 10:49:09 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@hub.org)
|
|
Received: by hub.org (TLB v0.10a (1.23 tibbs 1997/01/09 00:29:32)); Thu, 22 Oct 1998 10:44:59 +0000 (EDT)
|
|
Received: (from majordom@localhost)
|
|
by hub.org (8.9.1/8.8.8) id KAA14445
|
|
for pgsql-hackers-outgoing; Thu, 22 Oct 1998 10:44:57 -0400 (EDT)
|
|
(envelope-from owner-pgsql-hackers@postgreSQL.org)
|
|
X-Authentication-Warning: hub.org: majordom set sender to owner-pgsql-hackers@postgreSQL.org using -f
|
|
Received: from dsh.de (firewall-user@neptun.sns-felb.debis.de [53.122.101.2])
|
|
by hub.org (8.9.1/8.8.8) with ESMTP id KAA14431
|
|
for <pgsql-hackers@postgreSQL.org>; Thu, 22 Oct 1998 10:44:47 -0400 (EDT)
|
|
(envelope-from wieck@sapserv.debis.de)
|
|
Received: by dsh.de; id QAA19394; Thu, 22 Oct 1998 16:43:42 +0200 (MET DST)
|
|
Received: from dshmail.dsh.de(53.47.15.3) by neptun.dsh.de via smap (3.2)
|
|
id xma017268; Thu, 22 Oct 98 16:39:44 +0200
|
|
Received: from mail1.hh1.dsh.de (mail1.hh1.dsh.de [53.47.9.5])
|
|
by dshmail.dsh.de (8.8.7/8.8.7) with ESMTP id QAA02988;
|
|
Thu, 22 Oct 1998 16:36:46 +0200 (MET DST)
|
|
Received: from mars.SAPserv.Hamburg.dsh.de (root@mail5.hh1.dsh.de [53.2.168.17])
|
|
by mail1.hh1.dsh.de (8.8.7/8.8.7) with SMTP id QAA19155;
|
|
Thu, 22 Oct 1998 16:39:10 +0200
|
|
Received: from orion.SAPserv.Hamburg.dsh.de
|
|
by mars.SAPserv.Hamburg.dsh.de with smtp
|
|
for <<maillist@candle.pha.pa.us>>
|
|
id m0zWJWL-000B5DC; Thu, 22 Oct 98 14:07 MET DST
|
|
Received: by orion.SAPserv.Hamburg.dsh.de
|
|
for maillist@candle.pha.pa.us
|
|
id m0zWM4W-000EBPC; Thu, 22 Oct 98 16:50 MET DST
|
|
Message-Id: <m0zWM4W-000EBPC@orion.SAPserv.Hamburg.dsh.de>
|
|
From: jwieck@debis.com (Jan Wieck)
|
|
Subject: Re: [HACKERS] psql's help (the LIMIT stuff)
|
|
To: maillist@candle.pha.pa.us (Bruce Momjian)
|
|
Date: Thu, 22 Oct 1998 16:50:40 +0200 (MET DST)
|
|
Cc: jwieck@debis.com, jose@sferacarta.com, pgsql-hackers@postgreSQL.org
|
|
Reply-To: jwieck@debis.com (Jan Wieck)
|
|
In-Reply-To: <199810221424.KAA20601@candle.pha.pa.us> from "Bruce Momjian" at Oct 22, 98 10:24:08 am
|
|
X-Mailer: ELM [version 2.4 PL25]
|
|
Content-Type: text
|
|
Sender: owner-pgsql-hackers@postgreSQL.org
|
|
Precedence: bulk
|
|
Status: ROr
|
|
|
|
>
|
|
> > > >
|
|
> > > > I hope the QUERY_LIMIT too.
|
|
> > >
|
|
> > > I still have that cnfify() possible fix to review for KQSO. Are you
|
|
> > > still thinking limit for 6.4 final, or a minor release after that?
|
|
> >
|
|
> > I posted the part that is the minimum applied to 6.4 to make
|
|
> > adding LIMIT later non-initdb earlier. Anyway, here it's
|
|
> > again.
|
|
>
|
|
> Already applied. I assume it is the same as the one I applied.
|
|
|
|
Seen, thanks. Your 'Applied' just arrived after I packed it
|
|
again. It's the same.
|
|
|
|
> We are close to final, and can easily put it in 6.4.1, which I am sure
|
|
> we will need, and if we split CVS trees, you'll have lots of minor
|
|
> versions to pick from. :-)
|
|
>
|
|
> Seems like it would be a nice minor release item, but the problem is
|
|
> that minor releases aren't tested as much as major ones. How confident
|
|
> are you in the code? What do others thing?
|
|
|
|
I regression tested it, and did additional tests in the
|
|
SPI/PL area. It works. It only touches the parser and the
|
|
executor. Rules, planner/optimizer just bypass the values in
|
|
the parsetree. The parser and the executor are parts of
|
|
Postgres I feel very familiar with (not so in the optimizer).
|
|
I trust in the code and would use it in a production
|
|
environment.
|
|
|
|
It's below.
|
|
|
|
|
|
Jan
|
|
|
|
--
|
|
|
|
#======================================================================#
|
|
# It's easier to get forgiveness for being wrong than for being right. #
|
|
# Let's break this rule - forgive me. #
|
|
#======================================== jwieck@debis.com (Jan Wieck) #
|
|
|
|
|
|
diff -cr src.orig/backend/commands/command.c src/backend/commands/command.c
|
|
*** src.orig/backend/commands/command.c Fri Oct 16 11:53:38 1998
|
|
--- src/backend/commands/command.c Fri Oct 16 12:56:44 1998
|
|
***************
|
|
*** 39,44 ****
|
|
--- 39,45 ----
|
|
#include "utils/mcxt.h"
|
|
#include "utils/portal.h"
|
|
#include "utils/syscache.h"
|
|
+ #include "string.h"
|
|
|
|
/* ----------------
|
|
* PortalExecutorHeapMemory stuff
|
|
***************
|
|
*** 101,106 ****
|
|
--- 102,108 ----
|
|
int feature;
|
|
QueryDesc *queryDesc;
|
|
MemoryContext context;
|
|
+ Const limcount;
|
|
|
|
/* ----------------
|
|
* sanity checks
|
|
***************
|
|
*** 113,118 ****
|
|
--- 115,134 ----
|
|
}
|
|
|
|
/* ----------------
|
|
+ * Create a const node from the given count value
|
|
+ * ----------------
|
|
+ */
|
|
+ memset(&limcount, 0, sizeof(limcount));
|
|
+ limcount.type = T_Const;
|
|
+ limcount.consttype = INT4OID;
|
|
+ limcount.constlen = sizeof(int4);
|
|
+ limcount.constvalue = (Datum)count;
|
|
+ limcount.constisnull = FALSE;
|
|
+ limcount.constbyval = TRUE;
|
|
+ limcount.constisset = FALSE;
|
|
+ limcount.constiscast = FALSE;
|
|
+
|
|
+ /* ----------------
|
|
* get the portal from the portal name
|
|
* ----------------
|
|
*/
|
|
***************
|
|
*** 176,182 ****
|
|
PortalExecutorHeapMemory = (MemoryContext)
|
|
PortalGetHeapMemory(portal);
|
|
|
|
! ExecutorRun(queryDesc, PortalGetState(portal), feature, count);
|
|
|
|
if (dest == None) /* MOVE */
|
|
pfree(queryDesc);
|
|
--- 192,198 ----
|
|
PortalExecutorHeapMemory = (MemoryContext)
|
|
PortalGetHeapMemory(portal);
|
|
|
|
! ExecutorRun(queryDesc, PortalGetState(portal), feature, (Node *)NULL, (Node *)&limcount);
|
|
|
|
if (dest == None) /* MOVE */
|
|
pfree(queryDesc);
|
|
diff -cr src.orig/backend/executor/execMain.c src/backend/executor/execMain.c
|
|
*** src.orig/backend/executor/execMain.c Fri Oct 16 11:53:38 1998
|
|
--- src/backend/executor/execMain.c Fri Oct 16 20:05:19 1998
|
|
***************
|
|
*** 64,69 ****
|
|
--- 64,70 ----
|
|
static void EndPlan(Plan *plan, EState *estate);
|
|
static TupleTableSlot *ExecutePlan(EState *estate, Plan *plan,
|
|
Query *parseTree, CmdType operation,
|
|
+ int offsetTuples,
|
|
int numberTuples, ScanDirection direction,
|
|
void (*printfunc) ());
|
|
static void ExecRetrieve(TupleTableSlot *slot, void (*printfunc) (),
|
|
***************
|
|
*** 163,169 ****
|
|
* ----------------------------------------------------------------
|
|
*/
|
|
TupleTableSlot *
|
|
! ExecutorRun(QueryDesc *queryDesc, EState *estate, int feature, int count)
|
|
{
|
|
CmdType operation;
|
|
Query *parseTree;
|
|
--- 164,170 ----
|
|
* ----------------------------------------------------------------
|
|
*/
|
|
TupleTableSlot *
|
|
! ExecutorRun(QueryDesc *queryDesc, EState *estate, int feature, Node *limoffset, Node *limcount)
|
|
{
|
|
CmdType operation;
|
|
Query *parseTree;
|
|
***************
|
|
*** 171,176 ****
|
|
--- 172,179 ----
|
|
TupleTableSlot *result;
|
|
CommandDest dest;
|
|
void (*destination) ();
|
|
+ int offset = 0;
|
|
+ int count = 0;
|
|
|
|
/******************
|
|
* sanity checks
|
|
***************
|
|
*** 191,196 ****
|
|
--- 194,289 ----
|
|
estate->es_processed = 0;
|
|
estate->es_lastoid = InvalidOid;
|
|
|
|
+ /******************
|
|
+ * if given get the offset of the LIMIT clause
|
|
+ ******************
|
|
+ */
|
|
+ if (limoffset != NULL)
|
|
+ {
|
|
+ Const *coffset;
|
|
+ Param *poffset;
|
|
+ ParamListInfo paramLI;
|
|
+ int i;
|
|
+
|
|
+ switch (nodeTag(limoffset))
|
|
+ {
|
|
+ case T_Const:
|
|
+ coffset = (Const *)limoffset;
|
|
+ offset = (int)(coffset->constvalue);
|
|
+ break;
|
|
+
|
|
+ case T_Param:
|
|
+ poffset = (Param *)limoffset;
|
|
+ paramLI = estate->es_param_list_info;
|
|
+
|
|
+ if (paramLI == NULL)
|
|
+ elog(ERROR, "parameter for limit offset not in executor state");
|
|
+ for (i = 0; paramLI[i].kind != PARAM_INVALID; i++)
|
|
+ {
|
|
+ if (paramLI[i].kind == PARAM_NUM && paramLI[i].id == poffset->paramid)
|
|
+ break;
|
|
+ }
|
|
+ if (paramLI[i].kind == PARAM_INVALID)
|
|
+ elog(ERROR, "parameter for limit offset not in executor state");
|
|
+ if (paramLI[i].isnull)
|
|
+ elog(ERROR, "limit offset cannot be NULL value");
|
|
+ offset = (int)(paramLI[i].value);
|
|
+
|
|
+ break;
|
|
+
|
|
+ default:
|
|
+ elog(ERROR, "unexpected node type %d as limit offset", nodeTag(limoffset));
|
|
+ }
|
|
+
|
|
+ if (offset < 0)
|
|
+ elog(ERROR, "limit offset cannot be negative");
|
|
+ }
|
|
+
|
|
+ /******************
|
|
+ * if given get the count of the LIMIT clause
|
|
+ ******************
|
|
+ */
|
|
+ if (limcount != NULL)
|
|
+ {
|
|
+ Const *ccount;
|
|
+ Param *pcount;
|
|
+ ParamListInfo paramLI;
|
|
+ int i;
|
|
+
|
|
+ switch (nodeTag(limcount))
|
|
+ {
|
|
+ case T_Const:
|
|
+ ccount = (Const *)limcount;
|
|
+ count = (int)(ccount->constvalue);
|
|
+ break;
|
|
+
|
|
+ case T_Param:
|
|
+ pcount = (Param *)limcount;
|
|
+ paramLI = estate->es_param_list_info;
|
|
+
|
|
+ if (paramLI == NULL)
|
|
+ elog(ERROR, "parameter for limit count not in executor state");
|
|
+ for (i = 0; paramLI[i].kind != PARAM_INVALID; i++)
|
|
+ {
|
|
+ if (paramLI[i].kind == PARAM_NUM && paramLI[i].id == pcount->paramid)
|
|
+ break;
|
|
+ }
|
|
+ if (paramLI[i].kind == PARAM_INVALID)
|
|
+ elog(ERROR, "parameter for limit count not in executor state");
|
|
+ if (paramLI[i].isnull)
|
|
+ elog(ERROR, "limit count cannot be NULL value");
|
|
+ count = (int)(paramLI[i].value);
|
|
+
|
|
+ break;
|
|
+
|
|
+ default:
|
|
+ elog(ERROR, "unexpected node type %d as limit count", nodeTag(limcount));
|
|
+ }
|
|
+
|
|
+ if (count < 0)
|
|
+ elog(ERROR, "limit count cannot be negative");
|
|
+ }
|
|
+
|
|
switch (feature)
|
|
{
|
|
|
|
***************
|
|
*** 199,205 ****
|
|
plan,
|
|
parseTree,
|
|
operation,
|
|
! ALL_TUPLES,
|
|
ForwardScanDirection,
|
|
destination);
|
|
break;
|
|
--- 292,299 ----
|
|
plan,
|
|
parseTree,
|
|
operation,
|
|
! offset,
|
|
! count,
|
|
ForwardScanDirection,
|
|
destination);
|
|
break;
|
|
***************
|
|
*** 208,213 ****
|
|
--- 302,308 ----
|
|
plan,
|
|
parseTree,
|
|
operation,
|
|
+ offset,
|
|
count,
|
|
ForwardScanDirection,
|
|
destination);
|
|
***************
|
|
*** 222,227 ****
|
|
--- 317,323 ----
|
|
plan,
|
|
parseTree,
|
|
operation,
|
|
+ offset,
|
|
count,
|
|
BackwardScanDirection,
|
|
destination);
|
|
***************
|
|
*** 237,242 ****
|
|
--- 333,339 ----
|
|
plan,
|
|
parseTree,
|
|
operation,
|
|
+ 0,
|
|
ONE_TUPLE,
|
|
ForwardScanDirection,
|
|
destination);
|
|
***************
|
|
*** 691,696 ****
|
|
--- 788,794 ----
|
|
Plan *plan,
|
|
Query *parseTree,
|
|
CmdType operation,
|
|
+ int offsetTuples,
|
|
int numberTuples,
|
|
ScanDirection direction,
|
|
void (*printfunc) ())
|
|
***************
|
|
*** 742,747 ****
|
|
--- 840,859 ----
|
|
{
|
|
result = NULL;
|
|
break;
|
|
+ }
|
|
+
|
|
+ /******************
|
|
+ * For now we completely execute the plan and skip
|
|
+ * result tuples if requested by LIMIT offset.
|
|
+ * Finally we should try to do it in deeper levels
|
|
+ * if possible (during index scan)
|
|
+ * - Jan
|
|
+ ******************
|
|
+ */
|
|
+ if (offsetTuples > 0)
|
|
+ {
|
|
+ --offsetTuples;
|
|
+ continue;
|
|
}
|
|
|
|
/******************
|
|
diff -cr src.orig/backend/executor/functions.c src/backend/executor/functions.c
|
|
*** src.orig/backend/executor/functions.c Fri Oct 16 11:53:38 1998
|
|
--- src/backend/executor/functions.c Fri Oct 16 19:01:02 1998
|
|
***************
|
|
*** 130,135 ****
|
|
--- 130,138 ----
|
|
None);
|
|
estate = CreateExecutorState();
|
|
|
|
+ if (queryTree->limitOffset != NULL || queryTree->limitCount != NULL)
|
|
+ elog(ERROR, "LIMIT clause from SQL functions not yet implemented");
|
|
+
|
|
if (nargs > 0)
|
|
{
|
|
int i;
|
|
***************
|
|
*** 200,206 ****
|
|
|
|
feature = (LAST_POSTQUEL_COMMAND(es)) ? EXEC_RETONE : EXEC_RUN;
|
|
|
|
! return ExecutorRun(es->qd, es->estate, feature, 0);
|
|
}
|
|
|
|
static void
|
|
--- 203,209 ----
|
|
|
|
feature = (LAST_POSTQUEL_COMMAND(es)) ? EXEC_RETONE : EXEC_RUN;
|
|
|
|
! return ExecutorRun(es->qd, es->estate, feature, (Node *)NULL, (Node *)NULL);
|
|
}
|
|
|
|
static void
|
|
diff -cr src.orig/backend/executor/spi.c src/backend/executor/spi.c
|
|
*** src.orig/backend/executor/spi.c Fri Oct 16 11:53:39 1998
|
|
--- src/backend/executor/spi.c Fri Oct 16 19:25:33 1998
|
|
***************
|
|
*** 791,796 ****
|
|
--- 791,798 ----
|
|
bool isRetrieveIntoRelation = false;
|
|
char *intoName = NULL;
|
|
int res;
|
|
+ Const tcount_const;
|
|
+ Node *count = NULL;
|
|
|
|
switch (operation)
|
|
{
|
|
***************
|
|
*** 825,830 ****
|
|
--- 827,865 ----
|
|
return SPI_ERROR_OPUNKNOWN;
|
|
}
|
|
|
|
+ /* ----------------
|
|
+ * Get the query LIMIT tuple count
|
|
+ * ----------------
|
|
+ */
|
|
+ if (parseTree->limitCount != NULL)
|
|
+ {
|
|
+ /* ----------------
|
|
+ * A limit clause in the parsetree overrides the
|
|
+ * tcount parameter
|
|
+ * ----------------
|
|
+ */
|
|
+ count = parseTree->limitCount;
|
|
+ }
|
|
+ else
|
|
+ {
|
|
+ /* ----------------
|
|
+ * No LIMIT clause in parsetree. Use a local Const node
|
|
+ * to put tcount into it
|
|
+ * ----------------
|
|
+ */
|
|
+ memset(&tcount_const, 0, sizeof(tcount_const));
|
|
+ tcount_const.type = T_Const;
|
|
+ tcount_const.consttype = INT4OID;
|
|
+ tcount_const.constlen = sizeof(int4);
|
|
+ tcount_const.constvalue = (Datum)tcount;
|
|
+ tcount_const.constisnull = FALSE;
|
|
+ tcount_const.constbyval = TRUE;
|
|
+ tcount_const.constisset = FALSE;
|
|
+ tcount_const.constiscast = FALSE;
|
|
+
|
|
+ count = (Node *)&tcount_const;
|
|
+ }
|
|
+
|
|
if (state == NULL) /* plan preparation */
|
|
return res;
|
|
#ifdef SPI_EXECUTOR_STATS
|
|
***************
|
|
*** 845,851 ****
|
|
return SPI_OK_CURSOR;
|
|
}
|
|
|
|
! ExecutorRun(queryDesc, state, EXEC_FOR, tcount);
|
|
|
|
_SPI_current->processed = state->es_processed;
|
|
if (operation == CMD_SELECT && queryDesc->dest == SPI)
|
|
--- 880,886 ----
|
|
return SPI_OK_CURSOR;
|
|
}
|
|
|
|
! ExecutorRun(queryDesc, state, EXEC_FOR, parseTree->limitOffset, count);
|
|
|
|
_SPI_current->processed = state->es_processed;
|
|
if (operation == CMD_SELECT && queryDesc->dest == SPI)
|
|
diff -cr src.orig/backend/parser/analyze.c src/backend/parser/analyze.c
|
|
*** src.orig/backend/parser/analyze.c Fri Oct 16 11:53:41 1998
|
|
--- src/backend/parser/analyze.c Fri Oct 16 13:29:27 1998
|
|
***************
|
|
*** 180,186 ****
|
|
--- 180,190 ----
|
|
|
|
case T_SelectStmt:
|
|
if (!((SelectStmt *) parseTree)->portalname)
|
|
+ {
|
|
result = transformSelectStmt(pstate, (SelectStmt *) parseTree);
|
|
+ result->limitOffset = ((SelectStmt *)parseTree)->limitOffset;
|
|
+ result->limitCount = ((SelectStmt *)parseTree)->limitCount;
|
|
+ }
|
|
else
|
|
result = transformCursorStmt(pstate, (SelectStmt *) parseTree);
|
|
break;
|
|
diff -cr src.orig/backend/parser/gram.y src/backend/parser/gram.y
|
|
*** src.orig/backend/parser/gram.y Fri Oct 16 11:53:42 1998
|
|
--- src/backend/parser/gram.y Sun Oct 18 22:20:36 1998
|
|
***************
|
|
*** 45,50 ****
|
|
--- 45,51 ----
|
|
#include "catalog/catname.h"
|
|
#include "utils/elog.h"
|
|
#include "access/xact.h"
|
|
+ #include "catalog/pg_type.h"
|
|
|
|
#ifdef MULTIBYTE
|
|
#include "mb/pg_wchar.h"
|
|
***************
|
|
*** 163,169 ****
|
|
sort_clause, sortby_list, index_params, index_list, name_list,
|
|
from_clause, from_list, opt_array_bounds, nest_array_bounds,
|
|
expr_list, attrs, res_target_list, res_target_list2,
|
|
! def_list, opt_indirection, group_clause, TriggerFuncArgs
|
|
|
|
%type <node> func_return
|
|
%type <boolean> set_opt
|
|
--- 164,171 ----
|
|
sort_clause, sortby_list, index_params, index_list, name_list,
|
|
from_clause, from_list, opt_array_bounds, nest_array_bounds,
|
|
expr_list, attrs, res_target_list, res_target_list2,
|
|
! def_list, opt_indirection, group_clause, TriggerFuncArgs,
|
|
! opt_select_limit
|
|
|
|
%type <node> func_return
|
|
%type <boolean> set_opt
|
|
***************
|
|
*** 192,197 ****
|
|
--- 194,201 ----
|
|
|
|
%type <ival> fetch_how_many
|
|
|
|
+ %type <node> select_limit_value select_offset_value
|
|
+
|
|
%type <list> OptSeqList
|
|
%type <defelt> OptSeqElem
|
|
|
|
***************
|
|
*** 267,273 ****
|
|
FALSE_P, FETCH, FLOAT, FOR, FOREIGN, FROM, FULL,
|
|
GRANT, GROUP, HAVING, HOUR_P,
|
|
IN, INNER_P, INSENSITIVE, INSERT, INTERVAL, INTO, IS,
|
|
! JOIN, KEY, LANGUAGE, LEADING, LEFT, LIKE, LOCAL,
|
|
MATCH, MINUTE_P, MONTH_P, NAMES,
|
|
NATIONAL, NATURAL, NCHAR, NEXT, NO, NOT, NULL_P, NUMERIC,
|
|
OF, ON, ONLY, OPTION, OR, ORDER, OUTER_P,
|
|
--- 271,277 ----
|
|
FALSE_P, FETCH, FLOAT, FOR, FOREIGN, FROM, FULL,
|
|
GRANT, GROUP, HAVING, HOUR_P,
|
|
IN, INNER_P, INSENSITIVE, INSERT, INTERVAL, INTO, IS,
|
|
! JOIN, KEY, LANGUAGE, LEADING, LEFT, LIKE, LIMIT, LOCAL,
|
|
MATCH, MINUTE_P, MONTH_P, NAMES,
|
|
NATIONAL, NATURAL, NCHAR, NEXT, NO, NOT, NULL_P, NUMERIC,
|
|
OF, ON, ONLY, OPTION, OR, ORDER, OUTER_P,
|
|
***************
|
|
*** 299,305 ****
|
|
INCREMENT, INDEX, INHERITS, INSTEAD, ISNULL,
|
|
LANCOMPILER, LISTEN, LOAD, LOCATION, LOCK_P, MAXVALUE, MINVALUE, MOVE,
|
|
NEW, NOCREATEDB, NOCREATEUSER, NONE, NOTHING, NOTIFY, NOTNULL,
|
|
! OIDS, OPERATOR, PASSWORD, PROCEDURAL,
|
|
RECIPE, RENAME, RESET, RETURNS, ROW, RULE,
|
|
SEQUENCE, SERIAL, SETOF, SHOW, START, STATEMENT, STDIN, STDOUT, TRUSTED,
|
|
UNLISTEN, UNTIL, VACUUM, VALID, VERBOSE, VERSION
|
|
--- 303,309 ----
|
|
INCREMENT, INDEX, INHERITS, INSTEAD, ISNULL,
|
|
LANCOMPILER, LISTEN, LOAD, LOCATION, LOCK_P, MAXVALUE, MINVALUE, MOVE,
|
|
NEW, NOCREATEDB, NOCREATEUSER, NONE, NOTHING, NOTIFY, NOTNULL,
|
|
! OFFSET, OIDS, OPERATOR, PASSWORD, PROCEDURAL,
|
|
RECIPE, RENAME, RESET, RETURNS, ROW, RULE,
|
|
SEQUENCE, SERIAL, SETOF, SHOW, START, STATEMENT, STDIN, STDOUT, TRUSTED,
|
|
UNLISTEN, UNTIL, VACUUM, VALID, VERBOSE, VERSION
|
|
***************
|
|
*** 2591,2596 ****
|
|
--- 2595,2601 ----
|
|
result from_clause where_clause
|
|
group_clause having_clause
|
|
union_clause sort_clause
|
|
+ opt_select_limit
|
|
{
|
|
SelectStmt *n = makeNode(SelectStmt);
|
|
n->unique = $2;
|
|
***************
|
|
*** 2602,2607 ****
|
|
--- 2607,2622 ----
|
|
n->havingClause = $8;
|
|
n->unionClause = $9;
|
|
n->sortClause = $10;
|
|
+ if ($11 != NIL)
|
|
+ {
|
|
+ n->limitOffset = nth(0, $11);
|
|
+ n->limitCount = nth(1, $11);
|
|
+ }
|
|
+ else
|
|
+ {
|
|
+ n->limitOffset = NULL;
|
|
+ n->limitCount = NULL;
|
|
+ }
|
|
$$ = (Node *)n;
|
|
}
|
|
;
|
|
***************
|
|
*** 2699,2704 ****
|
|
--- 2714,2794 ----
|
|
| ASC { $$ = "<"; }
|
|
| DESC { $$ = ">"; }
|
|
| /*EMPTY*/ { $$ = "<"; /*default*/ }
|
|
+ ;
|
|
+
|
|
+ opt_select_limit: LIMIT select_offset_value ',' select_limit_value
|
|
+ { $$ = lappend(lappend(NIL, $2), $4); }
|
|
+ | LIMIT select_limit_value OFFSET select_offset_value
|
|
+ { $$ = lappend(lappend(NIL, $4), $2); }
|
|
+ | LIMIT select_limit_value
|
|
+ { $$ = lappend(lappend(NIL, NULL), $2); }
|
|
+ | OFFSET select_offset_value LIMIT select_limit_value
|
|
+ { $$ = lappend(lappend(NIL, $2), $4); }
|
|
+ | OFFSET select_offset_value
|
|
+ { $$ = lappend(lappend(NIL, $2), NULL); }
|
|
+ | /* EMPTY */
|
|
+ { $$ = NIL; }
|
|
+ ;
|
|
+
|
|
+ select_limit_value: Iconst
|
|
+ {
|
|
+ Const *n = makeNode(Const);
|
|
+
|
|
+ if ($1 < 1)
|
|
+ elog(ERROR, "selection limit must be ALL or a positive integer value > 0");
|
|
+
|
|
+ n->consttype = INT4OID;
|
|
+ n->constlen = sizeof(int4);
|
|
+ n->constvalue = (Datum)$1;
|
|
+ n->constisnull = FALSE;
|
|
+ n->constbyval = TRUE;
|
|
+ n->constisset = FALSE;
|
|
+ n->constiscast = FALSE;
|
|
+ $$ = (Node *)n;
|
|
+ }
|
|
+ | ALL
|
|
+ {
|
|
+ Const *n = makeNode(Const);
|
|
+ n->consttype = INT4OID;
|
|
+ n->constlen = sizeof(int4);
|
|
+ n->constvalue = (Datum)0;
|
|
+ n->constisnull = FALSE;
|
|
+ n->constbyval = TRUE;
|
|
+ n->constisset = FALSE;
|
|
+ n->constiscast = FALSE;
|
|
+ $$ = (Node *)n;
|
|
+ }
|
|
+ | PARAM
|
|
+ {
|
|
+ Param *n = makeNode(Param);
|
|
+ n->paramkind = PARAM_NUM;
|
|
+ n->paramid = $1;
|
|
+ n->paramtype = INT4OID;
|
|
+ $$ = (Node *)n;
|
|
+ }
|
|
+ ;
|
|
+
|
|
+ select_offset_value: Iconst
|
|
+ {
|
|
+ Const *n = makeNode(Const);
|
|
+
|
|
+ n->consttype = INT4OID;
|
|
+ n->constlen = sizeof(int4);
|
|
+ n->constvalue = (Datum)$1;
|
|
+ n->constisnull = FALSE;
|
|
+ n->constbyval = TRUE;
|
|
+ n->constisset = FALSE;
|
|
+ n->constiscast = FALSE;
|
|
+ $$ = (Node *)n;
|
|
+ }
|
|
+ | PARAM
|
|
+ {
|
|
+ Param *n = makeNode(Param);
|
|
+ n->paramkind = PARAM_NUM;
|
|
+ n->paramid = $1;
|
|
+ n->paramtype = INT4OID;
|
|
+ $$ = (Node *)n;
|
|
+ }
|
|
;
|
|
|
|
/*
|
|
diff -cr src.orig/backend/parser/keywords.c src/backend/parser/keywords.c
|
|
*** src.orig/backend/parser/keywords.c Fri Oct 16 11:53:42 1998
|
|
--- src/backend/parser/keywords.c Sun Oct 18 22:13:29 1998
|
|
***************
|
|
*** 128,133 ****
|
|
--- 128,134 ----
|
|
{"leading", LEADING},
|
|
{"left", LEFT},
|
|
{"like", LIKE},
|
|
+ {"limit", LIMIT},
|
|
{"listen", LISTEN},
|
|
{"load", LOAD},
|
|
{"local", LOCAL},
|
|
***************
|
|
*** 156,161 ****
|
|
--- 157,163 ----
|
|
{"null", NULL_P},
|
|
{"numeric", NUMERIC},
|
|
{"of", OF},
|
|
+ {"offset", OFFSET},
|
|
{"oids", OIDS},
|
|
{"old", CURRENT},
|
|
{"on", ON},
|
|
diff -cr src.orig/backend/rewrite/rewriteDefine.c src/backend/rewrite/rewriteDefine.c
|
|
*** src.orig/backend/rewrite/rewriteDefine.c Fri Oct 16 11:53:46 1998
|
|
--- src/backend/rewrite/rewriteDefine.c Fri Oct 16 13:48:55 1998
|
|
***************
|
|
*** 312,317 ****
|
|
--- 312,323 ----
|
|
heap_close(event_relation);
|
|
|
|
/*
|
|
+ * LIMIT in view is not supported
|
|
+ */
|
|
+ if (query->limitOffset != NULL || query->limitCount != NULL)
|
|
+ elog(ERROR, "LIMIT clause not supported in views");
|
|
+
|
|
+ /*
|
|
* ... and finally the rule must be named _RETviewname.
|
|
*/
|
|
sprintf(expected_name, "_RET%s", event_obj->relname);
|
|
diff -cr src.orig/backend/tcop/pquery.c src/backend/tcop/pquery.c
|
|
*** src.orig/backend/tcop/pquery.c Fri Oct 16 11:53:47 1998
|
|
--- src/backend/tcop/pquery.c Fri Oct 16 14:02:36 1998
|
|
***************
|
|
*** 40,46 ****
|
|
#include "commands/command.h"
|
|
|
|
static char *CreateOperationTag(int operationType);
|
|
! static void ProcessQueryDesc(QueryDesc *queryDesc);
|
|
|
|
|
|
/* ----------------------------------------------------------------
|
|
--- 40,46 ----
|
|
#include "commands/command.h"
|
|
|
|
static char *CreateOperationTag(int operationType);
|
|
! static void ProcessQueryDesc(QueryDesc *queryDesc, Node *limoffset, Node *limcount);
|
|
|
|
|
|
/* ----------------------------------------------------------------
|
|
***************
|
|
*** 205,211 ****
|
|
* ----------------------------------------------------------------
|
|
*/
|
|
static void
|
|
! ProcessQueryDesc(QueryDesc *queryDesc)
|
|
{
|
|
Query *parseTree;
|
|
Plan *plan;
|
|
--- 205,211 ----
|
|
* ----------------------------------------------------------------
|
|
*/
|
|
static void
|
|
! ProcessQueryDesc(QueryDesc *queryDesc, Node *limoffset, Node *limcount)
|
|
{
|
|
Query *parseTree;
|
|
Plan *plan;
|
|
***************
|
|
*** 330,336 ****
|
|
* actually run the plan..
|
|
* ----------------
|
|
*/
|
|
! ExecutorRun(queryDesc, state, EXEC_RUN, 0);
|
|
|
|
/* save infos for EndCommand */
|
|
UpdateCommandInfo(operation, state->es_lastoid, state->es_processed);
|
|
--- 330,336 ----
|
|
* actually run the plan..
|
|
* ----------------
|
|
*/
|
|
! ExecutorRun(queryDesc, state, EXEC_RUN, limoffset, limcount);
|
|
|
|
/* save infos for EndCommand */
|
|
UpdateCommandInfo(operation, state->es_lastoid, state->es_processed);
|
|
***************
|
|
*** 373,377 ****
|
|
print_plan(plan, parsetree);
|
|
}
|
|
else
|
|
! ProcessQueryDesc(queryDesc);
|
|
}
|
|
--- 373,377 ----
|
|
print_plan(plan, parsetree);
|
|
}
|
|
else
|
|
! ProcessQueryDesc(queryDesc, parsetree->limitOffset, parsetree->limitCount);
|
|
}
|
|
diff -cr src.orig/include/executor/executor.h src/include/executor/executor.h
|
|
*** src.orig/include/executor/executor.h Fri Oct 16 11:53:56 1998
|
|
--- src/include/executor/executor.h Fri Oct 16 12:04:17 1998
|
|
***************
|
|
*** 83,89 ****
|
|
* prototypes from functions in execMain.c
|
|
*/
|
|
extern TupleDesc ExecutorStart(QueryDesc *queryDesc, EState *estate);
|
|
! extern TupleTableSlot *ExecutorRun(QueryDesc *queryDesc, EState *estate, int feature, int count);
|
|
extern void ExecutorEnd(QueryDesc *queryDesc, EState *estate);
|
|
extern HeapTuple ExecConstraints(char *caller, Relation rel, HeapTuple tuple);
|
|
#ifdef QUERY_LIMIT
|
|
--- 83,89 ----
|
|
* prototypes from functions in execMain.c
|
|
*/
|
|
extern TupleDesc ExecutorStart(QueryDesc *queryDesc, EState *estate);
|
|
! extern TupleTableSlot *ExecutorRun(QueryDesc *queryDesc, EState *estate, int feature, Node *limoffset, Node *limcount);
|
|
extern void ExecutorEnd(QueryDesc *queryDesc, EState *estate);
|
|
extern HeapTuple ExecConstraints(char *caller, Relation rel, HeapTuple tuple);
|
|
#ifdef QUERY_LIMIT
|
|
|
|
|