In the simplest terms, a user-defined function (UDF) in SQL Server is a programming construct that accepts parameters, does work that typically makes use of the accepted parameters, and returns a type of result. This article will cover two types of UDFs: table-valued and scalar-valued.

Types of UDFs

Table-valued functions
A table-valued UDF is a function that accepts parameters and returns the results in the form of a table. This type of function is special because it returns a table that you can query the results of and join with other tables. In SQL Server 2005, field values from other tables may be passed into the function during a join operation to return a record based result. To accomplish this, you must use SQL Server 2005's APPLY operator.

It can be difficult to know when it is appropriate to use a VIEW vs. when it is appropriate to use a table-valued UDF. VIEWs are a great tool for data abstraction, combining data, and logically using subsets of data. I like to use table-valued UDFs when I need to use one or more values from different tables in a join operation where some type of calculation needs to be done and an aggregation returned.

Scalar-valued functions
A scalar-valued UDF accepts parameters and, ultimately, returns a single, atomic value. There are seven reasons why these types of functions are different than stored procedures in the database engine.

|> You cannot modify data inside of a UDF.
|> A scalar-valued UDF returns only one value, where a stored procedure can have numerous OUTPUT parameters.
|> You can use scalar-valued UDFs as the default value for a column in a table.
|> Scalar-valued UDFs are an easy way to define constant values to use in your database environment.
|> You can pass field values as parameters into UDFs.
|> You can nest scalar function calls. This means that you can pass a call to a scalar-valued function to another function or stored procedure.
|> You can use the results from scalar-valued UDFs as criteria in a WHERE statement. Although you can do it, this is typically not a good idea. (Later in the article, I'll explain why I try to avoid this common practice.)

There are two types of scalar-valued UDFs: deterministic and non-deterministic. Recognising the determinism of the functions that are created is important. An example of the importance is the creation of indexed views. One of the many restrictions of creating an index on a view is that the view definition cannot use a non-deterministic function.

Deterministic
A deterministic UDF always returns the same result with the same set of input parameters. Some examples of deterministic functions are the system functions MONTH(), YEAR(), and ISNULL().

Non-deterministic
A non-deterministic UDF is can potentially return a different value each time it is called with the same set of input parameters. Some examples of non-deterministic functions are the system functions GETDATE(), NEWID(), and @@CONNECTIONS.

Two examples of UDFs

Before presenting the examples, I will set up my SalesHistory table and load data into it:

IF OBJECT_ID('SalesHistory')>0
DROP TABLE SalesHistory;

CREATE TABLE [dbo].[SalesHistory]
(
[SaleID] [int] IDENTITY(1,1) NOT NULL PRIMARY KEY,
[Product] [varchar](10) NULL,
[SaleDate] [datetime] NULL,
[SalePrice] [money] NULL
)

DECLARE @i SMALLINT
SET @i = 1

WHILE (@i <=1000)
BEGIN

INSERT INTO SalesHistory(Product, SaleDate, SalePrice)
VALUES ('Computer', DATEADD(mm, @i, '3/11/1919'), DATEPART(ms, GETDATE()) + (@i + 57))

INSERT INTO SalesHistory(Product, SaleDate, SalePrice)
VALUES('BigScreen', DATEADD(mm, @i, '3/11/1927'), DATEPART(ms, GETDATE()) + (@i + 13))

INSERT INTO SalesHistory(Product, SaleDate, SalePrice)
VALUES('PoolTable', DATEADD(mm, @i, '3/11/1908'), DATEPART(ms, GETDATE()) + (@i + 29))

SET @i = @i + 1
END

GO

The first UDF I will look at is the scalar-valued UDF. The script below defines a function named dbo.udf_GetProductSales that accepts three parameters and returns a MONEY value. The function uses the three input parameters as criteria in calculating the total sales from the SalesHistory table.

CREATE FUNCTION dbo.udf_GetProductSales
(
@Product VARCHAR(10),
@BeginDate DATETIME,
@EndDate DATETIME
)
RETURNS MONEY
AS
BEGIN
DECLARE @Sales MONEY

SELECT @Sales = SUM(SalePrice)
FROM SalesHistory
WHERE
Product = @Product AND
SaleDate BETWEEN @BeginDate AND @EndDate

RETURN(@Sales)
END

The script below calls the UDF created in the above script. Note: The schema the function belongs to must be used in the call. In this case, the function belongs to the dbo schema.

SELECT dbo.udf_GetProductSales('PoolTable', '1/1/1990', '1/1/2000')


I usually discourage using scalar-valued UDFs in a WHERE criteria statement because, for every record considered in the query, the scalar-valued function will be called. This means that a function used in the WHERE criteria will cause a scan of the values being searched, which is going to be slower than if an index is able to be used. (I will provide more details on this concept in a future article.)


Although the use of a correlated sub-query is sometimes confusing and complicated, the use of them can help solve some of the more challenging query problems. While using these special queries is useful, they only return one column of data. You can use the upgraded table-valued UDFs in SQL Server 2005 to overcome this shortcoming. I'll show you how to use the APPLY operator to accept column values from a table and return a table-result of correlated values.


CREATE FUNCTION dbo.udf_GetProductSalesTable
(
@Product VARCHAR(10),
@SaleID INT
)
RETURNS @SalesTable TABLE
(
SalesTotal MONEY,
SalesCount INT
)

BEGIN

INSERT INTO @SalesTable(SalesTotal, SalesCount)
SELECT
SUM(SalePrice), COUNT(SaleID)
FROM
SalesHistory
WHERE
Product = @Product AND
SaleID <= @SaleID

RETURN
END
GO

The above function accepts the particular product for which we were searching, along with the SaleID from the SalesHistory table. From the function definition, you can see that the function returns a table named @SalesTable that contains two columns: SalesTotal and SalesCount. The body of the function inserts aggregate values into the @SalesTable table variable based upon the input parameters.

The following code uses the APPLY operator to invoke the table-valued function with the values from the SalesHistory table. (Note: Logically, you may want to use a JOIN operator here, but it is not necessary. The APPLY operator essentially does the "JOIN" for us by applying the values from the SalesHistory table to the table-valued function. In a sense, this code works the same way a correlated sub-query does, except that it can return multiple correlated values.)

SELECT * FROM SalesHistory sh
CROSS APPLY dbo.udf_GetProductSalesTable(sh.Product, sh.SaleID)
ORDER BY sh.SaleID ASC

Database Jumping This was published in Database Jumping, check every Thursday for more stories

Comments

1

Juan - 13/12/07

Modified table-value function for performance.

/* Original code */
CREATE FUNCTION dbo.udf_GetProductSalesTable
(
@Product VARCHAR(10),
@SaleID INT
)
RETURNS @SalesTable TABLE
(
SalesTotal MONEY,
SalesCount INT
)

BEGIN

INSERT INTO @SalesTable(SalesTotal, SalesCount)
SELECT
SUM(SalePrice), COUNT(SaleID)
FROM
SalesHistory
WHERE
Product = @Product AND
SaleID <= @SaleID

RETURN
END
GO

/* Modified code */
CREATE FUNCTION dbo.udf_GetProductSalesTable
(
@Product VARCHAR(10),
@SaleID INT
)
RETURNS TABLE
AS
RETURN
(
SELECT
SUM(SalePrice), COUNT(SaleID)
FROM
SalesHistory
WHERE
Product = @Product AND
SaleID <= @SaleID
)
GO

» Report offensive content

2

Luke - 13/12/07

A caveat: in SQL Server 2000, UDF performance fell dramatically (like, two orders of magnitude) when the table size exceeded ~200 records. I don't know if that's true in SQL 2005, too, but it's something to be aware of before relying on UDFs to return large data sets.

» Report offensive content

3

Mark - 01/01/08

Luke, please clarify. I have successfully used table-valued functions which return thousands of rows, which I then join with other tables in the FROM clause, much like I would have used a view. I can, however, see that using a UDF in the WHERE or SELECT clause would cause serious performance issues. Which kind are you referring to here?

» Report offensive content

4

Fred - 31/05/08

About the use of UDFs in where clauses: they're not always bad; it all depends on where in the where clause they are used ;-) that is, care should be taken not to prevent SQL using indexes, but apart of that, for what I know, UDF's in where clauses are fine.

for example suppose a table of events with an index on their times:

(A) good: select * from events where events.t = getdate()
(B) bad: select * from events where myudf(events.t) = xxx

(A) is OK because getdate() is computed once, and then used as search key using the index on t.
(B) is bad because the server cannot use the index any more and has to compute myudf(events.t) on each entry, before it finds one that matches xxx.

You probably already know that, but I think it's worth pointing out.

» Report offensive content

5

Fred - 31/05/08

(of course, my examples were hastily written; you would be lucky if query (A) returned anything, and getdate() is not an UDF; just replace getdate() by some UDF.)

» Report offensive content

Leave a comment

You must read and type the 6 chars within 0..9 and A..F

* indicates mandatory fields.

5

Fred - 31/05/08

(of course, my examples were hastily written; you would be lucky if query (A) returned anything, and getdate() is not an ... more

4

Fred - 31/05/08

About the use of UDFs in where clauses: they're not always bad; it all depends on where in the where clause ... more

3

Mark - 01/01/08

Luke, please clarify. I have successfully used table-valued functions which return thousands of rows, which I then join with other ... more

Log in


Sign up | Forgot your password?

  • Staff Microsoft shows off IE9 preview

    This week, highlights from Microsoft's MIX10 conference and more in the Roundup. Read more »

    -- posted by Staff

  • Chris Duckett IE9's H.264 vote killed Ogg

    In a split decision by the judges, the winner of the W3C/WHATWG video codec consensus is H.264, taking home the future of video playback on the internet while loser Ogg goes home with nothing but thoughts of what might have been. Read more »

    -- posted by Chris Duckett

  • Staff Google launches Apps Marketplace

    Google launches and app store, while Mozilla plans to re-write its open-source license. More of this week's news in the Roundup. Read more »

    -- posted by Staff

What's on?

  • Optus Deal

    Broadband + home phone + PlayStation®3 in a single package price!