mirror of
https://git.postgresql.org/git/postgresql.git
synced 2025-01-06 15:24:56 +08:00
Remove FAQ and FAQ_DEV ASCII and HTML files from CVS; now on the wiki.
Per-language files kept for transator usage.
This commit is contained in:
parent
2762e5555c
commit
bf4497cc61
873
doc/FAQ
873
doc/FAQ
@ -1,873 +0,0 @@
|
||||
|
||||
Frequently Asked Questions (FAQ) for PostgreSQL
|
||||
|
||||
Last updated: Tue Sep 23 16:19:49 EDT 2008
|
||||
|
||||
Current maintainer: Bruce Momjian (bruce@momjian.us)
|
||||
|
||||
The most recent version of this document can be viewed at
|
||||
http://www.postgresql.org/files/documentation/faqs/FAQ.html.
|
||||
|
||||
Platform-specific questions are answered at
|
||||
http://www.postgresql.org/docs/faq/.
|
||||
_________________________________________________________________
|
||||
|
||||
General Questions
|
||||
|
||||
1.1) What is PostgreSQL? How is it pronounced? What is Postgres?
|
||||
1.2) Who controls PostgreSQL?
|
||||
1.3) What is the copyright of PostgreSQL?
|
||||
1.4) What platforms does PostgreSQL support?
|
||||
1.5) Where can I get PostgreSQL?
|
||||
1.6) What is the most recent release?
|
||||
1.7) Where can I get support?
|
||||
1.8) How do I submit a bug report?
|
||||
1.9) How do I find out about known bugs or missing features?
|
||||
1.10) What documentation is available?
|
||||
1.11) How can I learn SQL?
|
||||
1.12) How do I submit a patch or join the development team?
|
||||
1.13) How does PostgreSQL compare to other DBMSs? Can PostgreSQL be
|
||||
embedded?
|
||||
1.14) Will PostgreSQL handle recent daylight saving time changes in
|
||||
various countries?
|
||||
1.15) How do I unsubscribe from the PostgreSQL email lists? How do I
|
||||
avoid receiving duplicate emails?
|
||||
|
||||
User Client Questions
|
||||
|
||||
2.1) What interfaces are available for PostgreSQL?
|
||||
2.2) What tools are available for using PostgreSQL with Web pages?
|
||||
2.3) Does PostgreSQL have a graphical user interface?
|
||||
|
||||
Administrative Questions
|
||||
|
||||
3.1) How do I install PostgreSQL somewhere other than
|
||||
/usr/local/pgsql?
|
||||
3.2) How do I control connections from other hosts?
|
||||
3.3) How do I tune the database engine for better performance?
|
||||
3.4) What debugging features are available?
|
||||
3.5) Why do I get "Sorry, too many clients" when trying to connect?
|
||||
3.6 What is the upgrade process for PostgreSQL?
|
||||
3.7) What computer hardware should I use?
|
||||
|
||||
Operational Questions
|
||||
|
||||
4.1) How do I SELECT only the first few rows of a query? A random row?
|
||||
4.2) How do I find out what tables, indexes, databases, and users are
|
||||
defined? How do I see the queries used by psql to display them?
|
||||
4.3) How do you change a column's data type?
|
||||
4.4) What is the maximum size for a row, a table, and a database?
|
||||
4.5) How much database disk space is required to store data from a
|
||||
typical text file?
|
||||
4.6) Why are my queries slow? Why don't they use my indexes?
|
||||
4.7) How do I see how the query optimizer is evaluating my query?
|
||||
4.8) How do I perform regular expression searches and case-insensitive
|
||||
regular expression searches? How do I use an index for
|
||||
case-insensitive searches?
|
||||
4.9) In a query, how do I detect if a field is NULL? How do I
|
||||
concatenate possible NULLs? How can I sort on whether a field is NULL
|
||||
or not?
|
||||
4.10) What is the difference between the various character types?
|
||||
4.11.1) How do I create a serial/auto-incrementing field?
|
||||
4.11.2) How do I get the value of a SERIAL insert?
|
||||
4.11.3) Doesn't currval() lead to a race condition with other users?
|
||||
4.11.4) Why aren't my sequence numbers reused on transaction abort?
|
||||
Why are there gaps in the numbering of my sequence/SERIAL column?
|
||||
4.12) What is an OID? What is a CTID?
|
||||
4.13) Why do I get the error "ERROR: Memory exhausted in
|
||||
AllocSetAlloc()"?
|
||||
4.14) How do I tell what PostgreSQL version I am running?
|
||||
4.15) How do I create a column that will default to the current time?
|
||||
4.16) How do I perform an outer join?
|
||||
4.17) How do I perform queries using multiple databases?
|
||||
4.18) How do I return multiple rows or columns from a function?
|
||||
4.19) Why do I get "relation with OID ##### does not exist" errors
|
||||
when accessing temporary tables in PL/PgSQL functions?
|
||||
4.20) What replication solutions are available?
|
||||
4.21) Why are my table and column names not recognized in my query?
|
||||
Why is capitalization not preserved?
|
||||
_________________________________________________________________
|
||||
|
||||
General Questions
|
||||
|
||||
1.1) What is PostgreSQL? How is it pronounced? What is Postgres?
|
||||
|
||||
PostgreSQL is pronounced Post-Gres-Q-L. (For those curious about how
|
||||
to say "PostgreSQL", an audio file is available.)
|
||||
|
||||
PostgreSQL is an object-relational database system that has the
|
||||
features of traditional commercial database systems with enhancements
|
||||
to be found in next-generation DBMS systems. PostgreSQL is free and
|
||||
the complete source code is available.
|
||||
|
||||
PostgreSQL development is performed by a team of mostly volunteer
|
||||
developers spread throughout the world and communicating via the
|
||||
Internet. It is a community project and is not controlled by any
|
||||
company. To get involved, see the developer's FAQ at
|
||||
http://www.postgresql.org/docs/faqs.FAQ_DEV.html
|
||||
|
||||
Postgres is a widely-used nickname for PostgreSQL. It was the original
|
||||
name of the project at Berkeley and is strongly preferred over other
|
||||
nicknames. If you find 'PostgreSQL' hard to pronounce, call it
|
||||
'Postgres' instead.
|
||||
|
||||
1.2) Who controls PostgreSQL?
|
||||
|
||||
If you are looking for a PostgreSQL gatekeeper, central committee, or
|
||||
controlling company, give up --- there isn't one. We do have a core
|
||||
committee and CVS committers, but these groups are more for
|
||||
administrative purposes than control. The project is directed by the
|
||||
community of developers and users, which anyone can join. All you need
|
||||
to do is subscribe to the mailing lists and participate in the
|
||||
discussions. (See the Developer's FAQ for information on how to get
|
||||
involved in PostgreSQL development.)
|
||||
|
||||
1.3) What is the copyright of PostgreSQL?
|
||||
|
||||
PostgreSQL is distributed under the classic BSD license. Basically, it
|
||||
allows users to do anything they want with the code, including
|
||||
reselling binaries without the source code. The only restriction is
|
||||
that you not hold us legally liable for problems with the software.
|
||||
There is also the requirement that this copyright appear in all copies
|
||||
of the software. Here is the actual BSD license we use:
|
||||
|
||||
PostgreSQL Data Base Management System
|
||||
|
||||
Portions Copyright (c) 1996-2009, PostgreSQL Global Development Group
|
||||
Portions Copyright (c) 1994-1996 Regents of the University of
|
||||
California
|
||||
|
||||
Permission to use, copy, modify, and distribute this software and its
|
||||
documentation for any purpose, without fee, and without a written
|
||||
agreement is hereby granted, provided that the above copyright notice
|
||||
and this paragraph and the following two paragraphs appear in all
|
||||
copies.
|
||||
|
||||
IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY
|
||||
FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES,
|
||||
INCLUDING LOST PROFITS, ARISING OUT OF THE USE OF THIS SOFTWARE AND
|
||||
ITS DOCUMENTATION, EVEN IF THE UNIVERSITY OF CALIFORNIA HAS BEEN
|
||||
ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
|
||||
|
||||
THE UNIVERSITY OF CALIFORNIA SPECIFICALLY DISCLAIMS ANY WARRANTIES,
|
||||
INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
|
||||
MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE SOFTWARE
|
||||
PROVIDED HEREUNDER IS ON AN "AS IS" BASIS, AND THE UNIVERSITY OF
|
||||
CALIFORNIA HAS NO OBLIGATIONS TO PROVIDE MAINTENANCE, SUPPORT,
|
||||
UPDATES, ENHANCEMENTS, OR MODIFICATIONS.
|
||||
|
||||
1.4) What platforms does PostgreSQL support?
|
||||
|
||||
In general, any modern Unix-compatible platform should be able to run
|
||||
PostgreSQL. The platforms that had received explicit testing at the
|
||||
time of release are listed in the installation instructions.
|
||||
|
||||
PostgreSQL also runs natively on Microsoft Windows NT-based operating
|
||||
systems like Win2000 SP4, WinXP, and Win2003. A prepackaged installer
|
||||
is available at http://www.postgresql.org/download/windows.
|
||||
MSDOS-based versions of Windows (Win95, Win98, WinMe) can run
|
||||
PostgreSQL using Cygwin.
|
||||
|
||||
There is also a Novell Netware 6 port at
|
||||
http://developer.novell.com/wiki/index.php/Postgresql, and an OS/2
|
||||
(eComStation) version at
|
||||
http://hobbes.nmsu.edu/cgi-bin/h-search?sh=1&button=Search&key=postgre
|
||||
SQL&stype=all&sort=type&dir=%2F.
|
||||
|
||||
1.5) Where can I get PostgreSQL?
|
||||
|
||||
Via web browser, use http://www.postgresql.org/ftp/, and via ftp, use
|
||||
ftp://ftp.postgresql.org/pub/.
|
||||
|
||||
1.6) What is the most recent release?
|
||||
|
||||
The latest release of PostgreSQL is version 8.3.3.
|
||||
|
||||
We plan to have a major release every year, with minor releases every
|
||||
few months.
|
||||
|
||||
1.7) Where can I get support?
|
||||
|
||||
The PostgreSQL community provides assistance to many of its users via
|
||||
email. The main web site to subscribe to the email lists is
|
||||
http://www.postgresql.org/community/lists/. The general or bugs lists
|
||||
are a good place to start.
|
||||
|
||||
The major IRC channel is #postgresql on Freenode (irc.freenode.net).
|
||||
To connect you can use the Unix program irc -c '#postgresql' "$USER"
|
||||
irc.freenode.net or use any other IRC clients. A Spanish one also
|
||||
exists on the same network, (#postgresql-es), a French one,
|
||||
(#postgresqlfr), and a Brazilian one, (#postgresql-br). There is also
|
||||
a PostgreSQL channel on EFNet.
|
||||
|
||||
A list of commercial support companies is available at
|
||||
http://www.postgresql.org/support/professional_support.
|
||||
|
||||
1.8) How do I submit a bug report?
|
||||
|
||||
Visit the PostgreSQL bug form at
|
||||
http://www.postgresql.org/support/submitbug. Also check out our ftp
|
||||
site ftp://ftp.postgresql.org/pub/ to see if there is a more recent
|
||||
PostgreSQL version.
|
||||
|
||||
Bugs submitted using the bug form or posted to any PostgreSQL mailing
|
||||
list typically generates one of the following replies:
|
||||
* It is not a bug, and why
|
||||
* It is a known bug and is already on the TODO list
|
||||
* The bug has been fixed in the current release
|
||||
* The bug has been fixed but is not packaged yet in an official
|
||||
release
|
||||
* A request is made for more detailed information:
|
||||
+ Operating system
|
||||
+ PostgreSQL version
|
||||
+ Reproducible test case
|
||||
+ Debugging information
|
||||
+ Debugger backtrace output
|
||||
* The bug is new. The following might happen:
|
||||
+ A patch is created and will be included in the next major or
|
||||
minor release
|
||||
+ The bug cannot be fixed immediately and is added to the TODO
|
||||
list
|
||||
|
||||
1.9) How do I find out about known bugs or missing features?
|
||||
|
||||
PostgreSQL supports an extended subset of SQL:2003. See our TODO list
|
||||
for known bugs, missing features, and future plans.
|
||||
|
||||
A feature request usually results in one of the following replies:
|
||||
* The feature is already on the TODO list
|
||||
* The feature is not desired because:
|
||||
+ It duplicates existing functionality that already follows the
|
||||
SQL standard
|
||||
+ The feature would increase code complexity but add little
|
||||
benefit
|
||||
+ The feature would be insecure or unreliable
|
||||
* The new feature is added to the TODO list
|
||||
|
||||
PostgreSQL does not use a bug tracking system because we find it more
|
||||
efficient to respond directly to email and keep the TODO list
|
||||
up-to-date. In practice, bugs don't last very long in the software,
|
||||
and bugs that affect a large number of users are fixed rapidly. The
|
||||
only place to find all changes, improvements, and fixes in a
|
||||
PostgreSQL release is to read the CVS log messages. Even the release
|
||||
notes do not list every change made to the software.
|
||||
|
||||
1.10) What documentation is available?
|
||||
|
||||
PostgreSQL includes extensive documentation, including a large manual,
|
||||
manual pages, and some test examples. See the /doc directory. You can
|
||||
also browse the manuals online at http://www.postgresql.org/docs.
|
||||
|
||||
There are two PostgreSQL books available online at
|
||||
http://www.postgresql.org/docs/books/awbook.html and
|
||||
http://www.commandprompt.com/ppbook/. There are a number of PostgreSQL
|
||||
books available for purchase. One of the most popular ones is by Korry
|
||||
Douglas. A list of book reviews can be found at
|
||||
http://www.postgresql.org/docs/books/. There is also a collection of
|
||||
PostgreSQL technical articles at
|
||||
http://wiki.postgresql.org/wiki/Community_Generated_Articles%2C_Guides
|
||||
%2C_and_Documentation.
|
||||
|
||||
The command line client program psql has some \d commands to show
|
||||
information about types, operators, functions, aggregates, etc. - use
|
||||
\? to display the available commands.
|
||||
|
||||
Our web site contains even more documentation.
|
||||
|
||||
1.11) How can I learn SQL?
|
||||
|
||||
First, consider the PostgreSQL-specific books mentioned above. Many of
|
||||
our users also like The Practical SQL Handbook, Bowman, Judith S., et
|
||||
al., Addison-Wesley. Others like The Complete Reference SQL, Groff et
|
||||
al., McGraw-Hill.
|
||||
|
||||
There are also many nice tutorials available online:
|
||||
* http://www.intermedia.net/support/sql/sqltut.shtm
|
||||
* http://sqlcourse.com
|
||||
* http://www.w3schools.com/sql/default.asp
|
||||
* http://mysite.verizon.net/Graeme_Birchall/id1.html
|
||||
* http://sqlzoo.net
|
||||
|
||||
1.12) How do I submit a patch or join the development team?
|
||||
|
||||
See the Developer's FAQ.
|
||||
|
||||
1.13) How does PostgreSQL compare to other DBMSs? Can PostgreSQL be embedded?
|
||||
|
||||
There are several ways of measuring software: features, performance,
|
||||
reliability, support, and price.
|
||||
|
||||
Features
|
||||
PostgreSQL has most features present in large commercial DBMSs,
|
||||
like transactions, subselects, triggers, views, foreign key
|
||||
referential integrity, and sophisticated locking. We have some
|
||||
features they do not have, like user-defined types,
|
||||
inheritance, rules, and multi-version concurrency control to
|
||||
reduce lock contention.
|
||||
|
||||
Performance
|
||||
PostgreSQL's performance is comparable to other commercial and
|
||||
open source databases. It is faster for some things, slower for
|
||||
others. Our performance is usually +/-10% compared to other
|
||||
databases.
|
||||
|
||||
Reliability
|
||||
We realize that a DBMS must be reliable, or it is worthless. We
|
||||
strive to release well-tested, stable code that has a minimum
|
||||
of bugs. Each release has at least one month of beta testing,
|
||||
and our release history shows that we can provide stable, solid
|
||||
releases that are ready for production use. We believe we
|
||||
compare favorably to other database software in this area.
|
||||
|
||||
Support
|
||||
Our mailing lists provide contact with a large group of
|
||||
developers and users to help resolve any problems encountered.
|
||||
While we cannot guarantee a fix, commercial DBMSs do not always
|
||||
supply a fix either. Direct access to developers, the user
|
||||
community, manuals, and the source code often make PostgreSQL
|
||||
support superior to other DBMSs. There is commercial
|
||||
per-incident support available for those who need it. (See FAQ
|
||||
section 1.7.)
|
||||
|
||||
Price
|
||||
We are free for all use, both commercial and non-commercial.
|
||||
You can add our code to your product with no limitations,
|
||||
except those outlined in our BSD-style license stated above.
|
||||
|
||||
PostgreSQL is designed as a client/server architecture, which requires
|
||||
separate processes for each client and server, and various helper
|
||||
processes. Many embedded architectures can support such requirements.
|
||||
However, if your embedded architecture requires the database server to
|
||||
run inside the application process, you cannot use Postgres and should
|
||||
select a lighter-weight database solution.
|
||||
|
||||
1.14) Will PostgreSQL handle recent daylight saving time changes in various
|
||||
countries?
|
||||
|
||||
USA daylight saving time changes are included in PostgreSQL release
|
||||
8.0.[4+], and all later major releases, e.g. 8.1. Canada and Western
|
||||
Australia changes are included in 8.0.[10+], 8.1.[6+], and all later
|
||||
major releases. PostgreSQL releases prior to 8.0 use the operating
|
||||
system's timezone database for daylight saving information.
|
||||
|
||||
1.15) How do I unsubscribe from the PostgreSQL email lists? How do I avoid
|
||||
receiving duplicate emails?
|
||||
|
||||
The PostgreSQL Majordomo page allows subscribing or unsubscribing from
|
||||
any of the PostgreSQL email lists. (You might need to have your
|
||||
Majordomo password emailed to you to log in.)
|
||||
|
||||
All PostgreSQL email lists are configured so a group reply goes to the
|
||||
email list and the original email author. This is done so users
|
||||
receive the quickest possible email replies. If you would prefer not
|
||||
to receive duplicate email from the list in cases where you already
|
||||
receive an email directly, check eliminatecc from the Majordomo Change
|
||||
Settings page. You can also prevent yourself from receiving copies of
|
||||
emails you post to the lists by unchecking selfcopy.
|
||||
_________________________________________________________________
|
||||
|
||||
User Client Questions
|
||||
|
||||
2.1) What interfaces are available for PostgreSQL?
|
||||
|
||||
The PostgreSQL install includes only the C and embedded C interfaces.
|
||||
All other interfaces are independent projects that are downloaded
|
||||
separately; being separate allows them to have their own release
|
||||
schedule and development teams.
|
||||
|
||||
Some programming languages like PHP include an interface to
|
||||
PostgreSQL. Interfaces for languages like Perl, TCL, Python, and many
|
||||
others are available at http://pgfoundry.org.
|
||||
|
||||
2.2) What tools are available for using PostgreSQL with Web pages?
|
||||
|
||||
A nice introduction to Database-backed Web pages can be seen at:
|
||||
http://www.webreview.com
|
||||
|
||||
For Web integration, PHP (http://www.php.net) is an excellent
|
||||
interface.
|
||||
|
||||
For complex cases, many use the Perl and DBD::Pg with CGI.pm or
|
||||
mod_perl.
|
||||
|
||||
2.3) Does PostgreSQL have a graphical user interface?
|
||||
|
||||
There are a large number of GUI Tools that are available for
|
||||
PostgreSQL from both commercial and open source developers. A detailed
|
||||
list can be found in the Community Guide to PostgreSQL GUI Tools.
|
||||
_________________________________________________________________
|
||||
|
||||
Administrative Questions
|
||||
|
||||
3.1) How do I install PostgreSQL somewhere other than /usr/local/pgsql?
|
||||
|
||||
Specify the --prefix option when running configure.
|
||||
|
||||
3.2) How do I control connections from other hosts?
|
||||
|
||||
By default, PostgreSQL only allows connections from the local machine
|
||||
using Unix domain sockets or TCP/IP connections. Other machines will
|
||||
not be able to connect unless you modify listen_addresses in the
|
||||
postgresql.conf file, enable host-based authentication by modifying
|
||||
the $PGDATA/pg_hba.conf file, and restart the database server.
|
||||
|
||||
3.3) How do I tune the database engine for better performance?
|
||||
|
||||
There are three major areas for potential performance improvement:
|
||||
|
||||
Query Changes
|
||||
This involves modifying queries to obtain better performance:
|
||||
|
||||
+ Creation of indexes, including expression and partial indexes
|
||||
+ Use of COPY instead of multiple INSERTs
|
||||
+ Grouping of multiple statements into a single transaction to
|
||||
reduce commit overhead
|
||||
+ Use of CLUSTER when retrieving many rows from an index
|
||||
+ Use of LIMIT for returning a subset of a query's output
|
||||
+ Use of Prepared queries
|
||||
+ Use of ANALYZE to maintain accurate optimizer statistics
|
||||
+ Regular use of VACUUM or pg_autovacuum
|
||||
+ Dropping of indexes during large data changes
|
||||
|
||||
Server Configuration
|
||||
A number of postgresql.conf settings affect performance. For
|
||||
more details, see Administration Guide/Server Run-time
|
||||
Environment/Run-time Configuration for a full listing, and for
|
||||
commentary see
|
||||
http://www.varlena.com/varlena/GeneralBits/Tidbits/annotated_co
|
||||
nf_e.html and
|
||||
http://www.varlena.com/varlena/GeneralBits/Tidbits/perf.html.
|
||||
|
||||
Hardware Selection
|
||||
The effect of hardware on performance is detailed in
|
||||
http://www.powerpostgresql.com/PerfList/ and
|
||||
http://momjian.us/main/writings/pgsql/hw_performance/index.html
|
||||
.
|
||||
|
||||
3.4) What debugging features are available?
|
||||
|
||||
There are many log_* server configuration variables at
|
||||
http://www.postgresql.org/docs/current/interactive/runtime-config-logg
|
||||
ing.html that enable printing of query and process statistics which
|
||||
can be very useful for debugging and performance measurements.
|
||||
|
||||
3.5) Why do I get "Sorry, too many clients" when trying to connect?
|
||||
|
||||
You have reached the default limit of 100 database sessions. You need
|
||||
to increase the server's limit on how many concurrent backend
|
||||
processes it can start by changing the max_connections value in
|
||||
postgresql.conf and restarting the server.
|
||||
|
||||
3.6) What is the upgrade process for PostgreSQL?
|
||||
|
||||
See http://www.postgresql.org/support/versioning for a general
|
||||
discussion about upgrading, and
|
||||
http://www.postgresql.org/docs/current/static/install-upgrading.html
|
||||
for specific instructions.
|
||||
|
||||
3.7) What computer hardware should I use?
|
||||
|
||||
Because PC hardware is mostly compatible, people tend to believe that
|
||||
all PC hardware is of equal quality. It is not. ECC RAM, SCSI, and
|
||||
quality motherboards are more reliable and have better performance
|
||||
than less expensive hardware. PostgreSQL will run on almost any
|
||||
hardware, but if reliability and performance are important it is wise
|
||||
to research your hardware options thoroughly. A disk controller with a
|
||||
battery-backed cache is also useful. Our email lists can be used to
|
||||
discuss hardware options and tradeoffs.
|
||||
_________________________________________________________________
|
||||
|
||||
Operational Questions
|
||||
|
||||
4.1) How do I SELECT only the first few rows of a query? A random row?
|
||||
|
||||
To retrieve only a few rows, if you know at the number of rows needed
|
||||
at the time of the SELECT use LIMIT . If an index matches the ORDER BY
|
||||
it is possible the entire query does not have to be executed. If you
|
||||
don't know the number of rows at SELECT time, use a cursor and FETCH.
|
||||
|
||||
To SELECT a random row, use:
|
||||
SELECT col
|
||||
FROM tab
|
||||
ORDER BY random()
|
||||
LIMIT 1;
|
||||
|
||||
4.2) How do I find out what tables, indexes, databases, and users are
|
||||
defined? How do I see the queries used by psql to display them?
|
||||
|
||||
Use the \dt command to see tables in psql. For a complete list of
|
||||
commands inside psql you can use \?. Alternatively you can read the
|
||||
source code for psql in file pgsql/src/bin/psql/describe.c, it
|
||||
contains SQL commands that generate the output for psql's backslash
|
||||
commands. You can also start psql with the -E option so it will print
|
||||
out the queries it uses to execute the commands you give. PostgreSQL
|
||||
also provides an SQL compliant INFORMATION SCHEMA interface you can
|
||||
query to get information about the database.
|
||||
|
||||
There are also system tables beginning with pg_ that describe these
|
||||
too.
|
||||
|
||||
Use psql -l will list all databases.
|
||||
|
||||
Also try the file pgsql/src/tutorial/syscat.source. It illustrates
|
||||
many of the SELECTs needed to get information from the database system
|
||||
tables.
|
||||
|
||||
4.3) How do you change a column's data type?
|
||||
|
||||
Changing the data type of a column can be done easily in 8.0 and later
|
||||
with ALTER TABLE ALTER COLUMN TYPE.
|
||||
|
||||
In earlier releases, do this:
|
||||
BEGIN;
|
||||
ALTER TABLE tab ADD COLUMN new_col new_data_type;
|
||||
UPDATE tab SET new_col = CAST(old_col AS new_data_type);
|
||||
ALTER TABLE tab DROP COLUMN old_col;
|
||||
COMMIT;
|
||||
|
||||
You might then want to do VACUUM FULL tab to reclaim the disk space
|
||||
used by the expired rows.
|
||||
|
||||
4.4) What is the maximum size for a row, a table, and a database?
|
||||
|
||||
These are the limits:
|
||||
|
||||
Maximum size for a database? unlimited (32 TB databases exist)
|
||||
Maximum size for a table? 32 TB
|
||||
Maximum size for a row? 400 GB
|
||||
Maximum size for a field? 1 GB
|
||||
Maximum number of rows in a table? unlimited
|
||||
Maximum number of columns in a table? 250-1600 depending on column
|
||||
types
|
||||
Maximum number of indexes on a table? unlimited
|
||||
|
||||
Of course, these are not actually unlimited, but limited to available
|
||||
disk space and memory/swap space. Performance may suffer when these
|
||||
values get unusually large.
|
||||
|
||||
The maximum table size of 32 TB does not require large file support
|
||||
from the operating system. Large tables are stored as multiple 1 GB
|
||||
files so file system size limits are not important.
|
||||
|
||||
The maximum table size, row size, and maximum number of columns can be
|
||||
quadrupled by increasing the default block size to 32k. The maximum
|
||||
table size can also be increased using table partitioning.
|
||||
|
||||
One limitation is that indexes can not be created on columns longer
|
||||
than about 2,000 characters. Fortunately, such indexes are rarely
|
||||
needed. Uniqueness is best guaranteed by a function index of an MD5
|
||||
hash of the long column, and full text indexing allows for searching
|
||||
of words within the column.
|
||||
|
||||
4.5) How much database disk space is required to store data from a typical
|
||||
text file?
|
||||
|
||||
A PostgreSQL database may require up to five times the disk space to
|
||||
store data from a text file.
|
||||
|
||||
As an example, consider a file of 100,000 lines with an integer and
|
||||
text description on each line. Suppose the text string avergages
|
||||
twenty bytes in length. The flat file would be 2.8 MB. The size of the
|
||||
PostgreSQL database file containing this data can be estimated as 5.2
|
||||
MB:
|
||||
24 bytes: each row header (approximate)
|
||||
24 bytes: one int field and one text field
|
||||
+ 4 bytes: pointer on page to tuple
|
||||
----------------------------------------
|
||||
52 bytes per row
|
||||
|
||||
The data page size in PostgreSQL is 8192 bytes (8 KB), so:
|
||||
|
||||
8192 bytes per page
|
||||
------------------- = 158 rows per database page (rounded down)
|
||||
52 bytes per row
|
||||
|
||||
100000 data rows
|
||||
-------------------- = 633 database pages (rounded up)
|
||||
158 rows per page
|
||||
|
||||
633 database pages * 8192 bytes per page = 5,185,536 bytes (5.2 MB)
|
||||
|
||||
Indexes do not require as much overhead, but do contain the data that
|
||||
is being indexed, so they can be large also.
|
||||
|
||||
NULLs are stored as bitmaps, so they use very little space.
|
||||
|
||||
4.6) Why are my queries slow? Why don't they use my indexes?
|
||||
|
||||
Indexes are not used by every query. Indexes are used only if the
|
||||
table is larger than a minimum size, and the query selects only a
|
||||
small percentage of the rows in the table. This is because the random
|
||||
disk access caused by an index scan can be slower than a straight read
|
||||
through the table, or sequential scan.
|
||||
|
||||
To determine if an index should be used, PostgreSQL must have
|
||||
statistics about the table. These statistics are collected using
|
||||
VACUUM ANALYZE, or simply ANALYZE. Using statistics, the optimizer
|
||||
knows how many rows are in the table, and can better determine if
|
||||
indexes should be used. Statistics are also valuable in determining
|
||||
optimal join order and join methods. Statistics collection should be
|
||||
performed periodically as the contents of the table change.
|
||||
|
||||
Indexes are normally not used for ORDER BY or to perform joins. A
|
||||
sequential scan followed by an explicit sort is usually faster than an
|
||||
index scan of a large table. However, LIMIT combined with ORDER BY
|
||||
often will use an index because only a small portion of the table is
|
||||
returned.
|
||||
|
||||
If you believe the optimizer is incorrect in choosing a sequential
|
||||
scan, use SET enable_seqscan TO 'off' and run query again to see if an
|
||||
index scan is indeed faster.
|
||||
|
||||
When using wild-card operators such as LIKE or ~, indexes can only be
|
||||
used in certain circumstances:
|
||||
* The beginning of the search string must be anchored to the start
|
||||
of the string, i.e.
|
||||
+ LIKE patterns must not start with %.
|
||||
+ ~ (regular expression) patterns must start with ^.
|
||||
* The search string can not start with a character class, e.g.
|
||||
[a-e].
|
||||
* Case-insensitive searches such as ILIKE and ~* do not utilize
|
||||
indexes. Instead, use expression indexes, which are described in
|
||||
section 4.8.
|
||||
* The default C locale must be used during initdb because it is not
|
||||
possible to know the next-greatest character in a non-C locale.
|
||||
You can create a special text_pattern_ops index for such cases
|
||||
that work only for LIKE indexing. It is also possible to use full
|
||||
text indexing for word searches.
|
||||
|
||||
4.7) How do I see how the query optimizer is evaluating my query?
|
||||
|
||||
See the EXPLAIN manual page.
|
||||
|
||||
4.8) How do I perform regular expression searches and case-insensitive
|
||||
regular expression searches? How do I use an index for case-insensitive
|
||||
searches?
|
||||
|
||||
The ~ operator does regular expression matching, and ~* does
|
||||
case-insensitive regular expression matching. The case-insensitive
|
||||
variant of LIKE is called ILIKE.
|
||||
|
||||
Case-insensitive equality comparisons are normally expressed as:
|
||||
SELECT *
|
||||
FROM tab
|
||||
WHERE lower(col) = 'abc';
|
||||
|
||||
This will not use an standard index. However, if you create an
|
||||
expression index, it will be used:
|
||||
CREATE INDEX tabindex ON tab (lower(col));
|
||||
|
||||
If the above index is created as UNIQUE, though the column can store
|
||||
upper and lowercase characters, it can not have identical values that
|
||||
differ only in case. To force a particular case to be stored in the
|
||||
column, use a CHECK constraint or a trigger.
|
||||
|
||||
4.9) In a query, how do I detect if a field is NULL? How do I concatenate
|
||||
possible NULLs? How can I sort on whether a field is NULL or not?
|
||||
|
||||
You test the column with IS NULL and IS NOT NULL, like this:
|
||||
SELECT *
|
||||
FROM tab
|
||||
WHERE col IS NULL;
|
||||
|
||||
To concatentate with possible NULLs, use COALESCE(), like this:
|
||||
SELECT COALESCE(col1, '') || COALESCE(col2, '')
|
||||
FROM tab
|
||||
|
||||
To sort by the NULL status, use the IS NULL and IS NOT NULL modifiers
|
||||
in your ORDER BY clause. Things that are true will sort higher than
|
||||
things that are false, so the following will put NULL entries at the
|
||||
top of the resulting list:
|
||||
SELECT *
|
||||
FROM tab
|
||||
ORDER BY (col IS NOT NULL)
|
||||
|
||||
4.10) What is the difference between the various character types?
|
||||
|
||||
Type Internal Name Notes
|
||||
VARCHAR(n) varchar size specifies maximum length, no padding
|
||||
CHAR(n) bpchar blank padded to the specified fixed length
|
||||
TEXT text no specific upper limit on length
|
||||
BYTEA bytea variable-length byte array (null-byte safe)
|
||||
"char" char one character
|
||||
|
||||
You will see the internal name when examining system catalogs and in
|
||||
some error messages.
|
||||
|
||||
The first four types above are "varlena" types (i.e., the first four
|
||||
bytes on disk are the length, followed by the data). Thus the actual
|
||||
space used is slightly greater than the declared size. However, long
|
||||
values are also subject to compression, so the space on disk might
|
||||
also be less than expected.
|
||||
VARCHAR(n) is best when storing variable-length strings and it limits
|
||||
how long a string can be. TEXT is for strings of unlimited length,
|
||||
with a maximum of one gigabyte.
|
||||
|
||||
CHAR(n) is for storing strings that are all the same length. CHAR(n)
|
||||
pads with blanks to the specified length, while VARCHAR(n) only stores
|
||||
the characters supplied. BYTEA is for storing binary data,
|
||||
particularly values that include NULL bytes. All the types described
|
||||
here have similar performance characteristics.
|
||||
|
||||
4.11.1) How do I create a serial/auto-incrementing field?
|
||||
|
||||
PostgreSQL supports a SERIAL data type. It auto-creates a sequence.
|
||||
For example, this:
|
||||
CREATE TABLE person (
|
||||
id SERIAL,
|
||||
name TEXT
|
||||
);
|
||||
|
||||
is automatically translated into this:
|
||||
CREATE SEQUENCE person_id_seq;
|
||||
CREATE TABLE person (
|
||||
id INT4 NOT NULL DEFAULT nextval('person_id_seq'),
|
||||
name TEXT
|
||||
);
|
||||
|
||||
Automatically created sequence are named <table>_<serialcolumn>_seq,
|
||||
where table and serialcolumn are the names of the table and SERIAL
|
||||
column, respectively. See the create_sequence manual page for more
|
||||
information about sequences.
|
||||
|
||||
4.11.2) How do I get the value of a SERIAL insert?
|
||||
|
||||
The simplest way is to retrieve the assigned SERIAL value with
|
||||
RETURNING. Using the example table in 4.11.1, it would look like this:
|
||||
INSERT INTO person (name) VALUES ('Blaise Pascal') RETURNING id;
|
||||
|
||||
You can also call nextval() and use that value in the INSERT, or call
|
||||
currval() after the INSERT.
|
||||
|
||||
4.11.3) Doesn't currval() lead to a race condition with other users?
|
||||
|
||||
No. currval() returns the current value assigned by your session, not
|
||||
by all sessions.
|
||||
|
||||
4.11.4) Why aren't my sequence numbers reused on transaction abort? Why are
|
||||
there gaps in the numbering of my sequence/SERIAL column?
|
||||
|
||||
To improve concurrency, sequence values are given out to running
|
||||
transactions as needed and are not locked until the transaction
|
||||
completes. This causes gaps in numbering from aborted transactions.
|
||||
|
||||
4.12) What is an OID? What is a CTID?
|
||||
|
||||
If a table is created WITH OIDS, each row gets a unique a OID. OIDs
|
||||
are automatically assigned unique 4-byte integers that are unique
|
||||
across the entire installation. However, they overflow at 4 billion,
|
||||
and then the OIDs start being duplicated. PostgreSQL uses OIDs to link
|
||||
its internal system tables together.
|
||||
|
||||
To uniquely number rows in user tables, it is best to use SERIAL
|
||||
rather than OIDs because SERIAL sequences are unique only within a
|
||||
single table. and are therefore less likely to overflow. SERIAL8 is
|
||||
available for storing eight-byte sequence values.
|
||||
|
||||
CTIDs are used to identify specific physical rows with block and
|
||||
offset values. CTIDs change after rows are modified or reloaded. They
|
||||
are used by index entries to point to physical rows.
|
||||
|
||||
4.13) Why do I get the error "ERROR: Memory exhausted in AllocSetAlloc()"?
|
||||
|
||||
You probably have run out of virtual memory on your system, or your
|
||||
kernel has a low limit for certain resources. Try this before starting
|
||||
the server:
|
||||
ulimit -d 262144
|
||||
limit datasize 256m
|
||||
|
||||
Depending on your shell, only one of these may succeed, but it will
|
||||
set your process data segment limit much higher and perhaps allow the
|
||||
query to complete. This command applies to the current process, and
|
||||
all subprocesses created after the command is run. If you are having a
|
||||
problem with the SQL client because the backend is returning too much
|
||||
data, try it before starting the client.
|
||||
|
||||
4.14) How do I tell what PostgreSQL version I am running?
|
||||
|
||||
From psql, type SELECT version();
|
||||
|
||||
4.15) How do I create a column that will default to the current time?
|
||||
|
||||
Use CURRENT_TIMESTAMP:
|
||||
CREATE TABLE test (x int, modtime TIMESTAMP DEFAULT CURRENT_TIMESTAMP );
|
||||
|
||||
4.16) How do I perform an outer join?
|
||||
|
||||
PostgreSQL supports outer joins using the SQL standard syntax. Here
|
||||
are two examples:
|
||||
SELECT *
|
||||
FROM t1 LEFT OUTER JOIN t2 ON (t1.col = t2.col);
|
||||
|
||||
or
|
||||
SELECT *
|
||||
FROM t1 LEFT OUTER JOIN t2 USING (col);
|
||||
|
||||
These identical queries join t1.col to t2.col, and also return any
|
||||
unjoined rows in t1 (those with no match in t2). A RIGHT join would
|
||||
add unjoined rows of t2. A FULL join would return the matched rows
|
||||
plus all unjoined rows from t1 and t2. The word OUTER is optional and
|
||||
is assumed in LEFT, RIGHT, and FULL joins. Ordinary joins are called
|
||||
INNER joins.
|
||||
|
||||
4.17) How do I perform queries using multiple databases?
|
||||
|
||||
There is no way to query a database other than the current one.
|
||||
Because PostgreSQL loads database-specific system catalogs, it is
|
||||
uncertain how a cross-database query should even behave.
|
||||
|
||||
contrib/dblink allows cross-database queries using function calls. Of
|
||||
course, a client can also make simultaneous connections to different
|
||||
databases and merge the results on the client side.
|
||||
|
||||
4.18) How do I return multiple rows or columns from a function?
|
||||
|
||||
It is easy using set-returning functions,
|
||||
http://wiki.postgresql.org/wiki/Return_more_than_one_row_of_data_from_
|
||||
PL/pgSQL_functions.
|
||||
|
||||
4.19) Why do I get "relation with OID ##### does not exist" errors when
|
||||
accessing temporary tables in PL/PgSQL functions?
|
||||
|
||||
In PostgreSQL versions < 8.3, PL/PgSQL caches function scripts, and an
|
||||
unfortunate side effect is that if a PL/PgSQL function accesses a
|
||||
temporary table, and that table is later dropped and recreated, and
|
||||
the function called again, the function will fail because the cached
|
||||
function contents still point to the old temporary table. The solution
|
||||
is to use EXECUTE for temporary table access in PL/PgSQL. This will
|
||||
cause the query to be reparsed every time.
|
||||
|
||||
This problem does not occur in PostgreSQL 8.3 and later.
|
||||
|
||||
4.20) What replication solutions are available?
|
||||
|
||||
Though "replication" is a single term, there are several technologies
|
||||
for doing replication, with advantages and disadvantages for each.
|
||||
|
||||
Master/slave replication allows a single master to receive read/write
|
||||
queries, while slaves can only accept read/SELECT queries. The most
|
||||
popular freely available master-slave PostgreSQL replication solution
|
||||
is Slony-I.
|
||||
|
||||
Multi-master replication allows read/write queries to be sent to
|
||||
multiple replicated computers. This capability also has a severe
|
||||
impact on performance due to the need to synchronize changes between
|
||||
servers. PGCluster is the most popular such solution freely available
|
||||
for PostgreSQL.
|
||||
|
||||
There are also commercial and hardware-based replication solutions
|
||||
available supporting a variety of replication models.
|
||||
|
||||
4.21) Why are my table and column names not recognized in my query? Why is
|
||||
capitalization not preserved?
|
||||
|
||||
The most common cause of unrecognized names is the use of
|
||||
double-quotes around table or column names during table creation. When
|
||||
double-quotes are used, table and column names (called identifiers)
|
||||
are stored case-sensitive, meaning you must use double-quotes when
|
||||
referencing the names in a query. Some interfaces, like pgAdmin,
|
||||
automatically double-quote identifiers during table creation. So, for
|
||||
identifiers to be recognized, you must either:
|
||||
* Avoid double-quoting identifiers when creating tables
|
||||
* Use only lowercase characters in identifiers
|
||||
* Double-quote identifiers when referencing them in queries
|
@ -1,3 +0,0 @@
|
||||
The developer FAQ can be found on the PostgreSQL wiki:
|
||||
|
||||
http://wiki.postgresql.org/wiki/Development_information
|
1139
doc/src/FAQ/FAQ.html
1139
doc/src/FAQ/FAQ.html
File diff suppressed because it is too large
Load Diff
@ -1,14 +0,0 @@
|
||||
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 3.2//EN">
|
||||
|
||||
<HTML>
|
||||
<HEAD>
|
||||
<TITLE>PostgreSQL Developers FAQ</TITLE>
|
||||
</HEAD>
|
||||
|
||||
<BODY bgcolor="#FFFFFF" text="#000000" link="#FF0000" vlink="#A00000"
|
||||
alink="#0000FF">
|
||||
<P>The developer FAQ can be found on the PostgreSQL wiki:</P>
|
||||
|
||||
<P><A href="http://wiki.postgresql.org/wiki/Development_information">http://wiki.postgresql.org/wiki/Development_information</A></P>
|
||||
</BODY>
|
||||
</HTML>
|
Loading…
Reference in New Issue
Block a user