Difference between revisions of "About Table Joins"
imported>Aeric |
imported>Aeric |
||
Line 52: | Line 52: | ||
</div> | </div> | ||
Here, the row-matching criteria is specified in the <tt>ON</tt> clause. | Here, the row-matching criteria is specified in the <tt>ON</tt> clause. A <tt>WHERE</tt> clause would also work, but <tt>ON</tt> is more traditional in this context, and is likely to perform better. {{TBD| Is that a true statement?}} | ||
==LEFT and RIGHT Joins== | ==LEFT and RIGHT Joins== |
Revision as of 01:37, 19 November 2011
In a SQL query, you can join multiple tables together. In effect, you make one large virtual table to SELECT rows from.
How a Join Works
Whenever you specify multiple tables in a SQL query, a join is implied. If you don't specify any other selection criteria, the result is the cartesian product of the rows in the individual tables. So if table Alpha has rows A and B, while table Beta has rows 1 and 2, then the query:
- SELECT * FROM alpha,beta
returns 4 rows:
- A + 1
- A + 2
- B + 1
- B + 2
In general then, a table join will return N*M rows, where N & M are the number of rows in each table, respectively.
Of course, you're rarely interested in all possible combinations of all rows. What you're really interested in are the rows where one of the columns in table Alpha matches one of the columns in table Beta. And in general, the matching data you're looking for will be specified by a Lookup relationship.
To make it more concrete, consider the Sample Order Processing System:
- The Orders object has a Lookup to Customers
- That relationship is created by a field in the Orders object, related_to_Customer that contains the record ID of a Customer record.
- A SQL Join returns the product of all records in both tables:
- Order 1 for Customer A + Customer A
- Order 1 for Customer A + Customer B
- Order 1 for Customer A + Customer C
- Order 2 for Customer A + Customer A
- Order 2 for Customer A + Customer B
- Order 2 for Customer A + Customer C
- ...
- Order 1 for Customer B + Customer A
- Order 1 for Customer B + Customer B
- Order 1 for Customer B + Customer C
- etc.
- The records we're going to care about (highlighted above) are the ones where the related_to_Customer field in the Orders record matches the record ID of a Customer record.
- That relationship is Order.related_to_customer = Customer.id
The only remaining refinement to that concept is that, when specifying multiple tables in a SQL query, table aliases are required, and you use those alias to specify fields. So a full query can look something like this:
- SELECT c.customer_name, o.created_date
- FROM Customers AS c, Orders AS o
- WHERE o.related_to_customer = c.id
- Note:
If the table is named "Order" rather than "Orders", the table name needs to be enclosed in backticks, since "ORDER" is a reserved word: `Order`.
INNER JOIN
An INNER Join is the default. It returns only those rows that match the ON condition. So these two queries produce the same results as the previous example:
- SELECT c.customer_name, o.created_date
- FROM Customers AS c INNER JOIN Orders AS o
- ON o.related_to_customer = c.id
- SELECT c.customer_name, o.created_date
- FROM Customers AS c JOIN Orders AS o
- ON o.related_to_customer = c.id
Here, the row-matching criteria is specified in the ON clause. A WHERE clause would also work, but ON is more traditional in this context, and is likely to perform better. __TBD: Is that a true statement?__
LEFT and RIGHT Joins
Sometimes you want to include rows that don't have a matching value. To do that, you use a LEFT or RIGHT join.
For example, suppose you want a list of customers and the orders they made last month. A simple query like the one shown above tells you which customers made orders, but it doesn't tell you which ones didn't make an order. Of course, you could issue a separate query to get that information, but you can also get all of the information in a single list, by issuing a LEFT or RIGHT join.
So for a query of Customers AS c, Orders AS o:
- A LEFT join includes all rows in the Customers table (the table on the left) that don't have a matching row on the right.
- A RIGHT join includes all rows in the Orders table (the table on the right) that don't have a matching row on the left.
For example:
- SELECT c.customer_name, o.created_date
- FROM Customers AS c LEFT JOIN Orders AS o
- ON o.related_to_customer = c.id