karatejb: 11月 2020

2020年11月26日星期四

[SQL Server] Partition table

SQL Server Partition table

▌Introduction

This is a simple tutorial for creating partitions on an existing table in Microsoft SQL Server.

▌Environment

▋Microsoft SQL Server 2017 Express

▌Steps

▋Create a demo table

Lets create a table first. We will use [CreateOn] as the the value for partitioning.

CREATE TABLE [dbo].[OnlineTxs]
(
    [Id]       [numeric](18, 0) IDENTITY (1,1) NOT NULL,
    [CardNo]   [varchar](19)                   NOT NULL,
    [Amt]      [decimal](10, 0)                NOT NULL,
    [CreateOn] [datetime]                      NOT NULL,
    CONSTRAINT [PK_OnlineTxs] PRIMARY KEY NONCLUSTERED
    (
        [Id] ASC
    ) WITH (PAD_INDEX = OFF, STATISTICS_NORECOMPUTE = OFF, IGNORE_DUP_KEY = OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON) ON [PRIMARY]
)

▋ Create Partition Function and Scheme

Use Date as the partition boundary of the Partition Function.

CREATE PARTITION FUNCTION pf_date_range (datetime)
AS RANGE RIGHT
FOR VALUES ('2020/11/23','2020/11/24')
-- FOR VALUES (CONVERT(VARCHAR(10), dateadd(day,-1, GETDATE()), 111)) -- Yesterday
GO

CREATE PARTITION SCHEME ps_date_range
    AS PARTITION pf_date_range ALL TO ([PRIMARY])
GO

Verify the partitions we just made by the following SQL.

SELECT ps.name,pf.name,boundary_id,value
FROM sys.partition_schemes ps
INNER JOIN sys.partition_functions pf ON pf.function_id=ps.function_id
INNER JOIN sys.partition_range_values prf ON pf.function_id=prf.function_id

The records will be stored in this way,

Partition number	Range
1	< 2020-11-23 00:00:00
2	>= 2020-11-23 00:00:00 & < 2020-11-24 00:00:00
3	>= 2020-11-24 00:00:00

▋ Create the none-clustered constraint and create clustered index

Furthermore, we have to recreate the constraint, the primary key must

• Includes the Partition Boundary column: [CreateOn]

• None-clustered

• Be added on the Partition Function

And create a clustered index for the Partition Boundary column: [CreateOn].

ALTER TABLE [dbo].[OnlineTxs] DROP CONSTRAINT [PK_OnlineTxs]
GO

ALTER TABLE [dbo].[OnlineTxs]
    ADD CONSTRAINT [PK_OnlineTxs] PRIMARY KEY NONCLUSTERED ([Id] ASC, [CreateOn])
    ON ps_date_range([CreateOn]);
GO

-- Create cluster index
CREATE CLUSTERED INDEX [IX_OnlineTxs_createon] ON [dbo].[OnlineTxs] ([CreateOn])
WITH (STATISTICS_NORECOMPUTE = OFF, IGNORE_DUP_KEY = OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON)
ON ps_date_range([CreateOn])
GO

▋Test the partition table with data

We can use the following SQL to generate some data into the table.

CREATE FUNCTION fn_get_random (@GUID uniqueidentifier, @CEILING_VALUE INT)
RETURNS INT
AS
BEGIN
  DECLARE @RAND INT;
  SELECT @RAND = ABS(CHECKSUM(@GUID)) % @CEILING_VALUE;
  RETURN @RAND;
END

-- Create mock data
DECLARE @seq int = 0
WHILE @seq < 1000
BEGIN
    SET @seq = @seq + 1

    INSERT INTO [dbo].[OnlineTxs]([Cardno],[Amt],[CreateOn])
    SELECT '123456****789' AS [CardNo], dbo.fn_get_random(NEWID(), 9999) AS [Amt], DATEADD(day, -1 * dbo.fn_get_random(NEWID(),4), GETDATE()) AS [CreateOn]
END

Check the data by summarizing by Partition Number,

SELECT o.name AS [ObjectName],i.name AS [IndexName], partition_id AS [PartitionId], partition_number AS [PartitionNumber], [rows]
FROM sys.partitions p
INNER JOIN sys.objects o ON o.object_id=p.object_id
INNER JOIN sys.indexes i ON i.object_id=p.object_id and p.index_id=i.index_id
WHERE o.name LIKE '%OnlineTxs%'
ORDER BY [IndexName], [PartitionNumber]

Or list/count the records by Partition Number,

DECLARE @partition_number INT = 1;
SELECT COUNT(*) AS Cnt FROM [dbo].[OnlineTxs]
WHERE $PARTITION.pf_date_range([CreateOn]) = @partition_number;

▋Move on more actions

▋How to truncate by Partition

Truncating by Partition is much faster than doing deletion. Here is how to truncate rows in millionseconds by Partition Number.

DECLARE @partition_numer INT = 1;
TRUNCATE TABLE [dbo].[OnlineTxs] WITH (PARTITIONS(@partition_number));

▋How to create new Partition

Alter the Partition Function to have more Partitions,

ALTER PARTITION SCHEME ps_date_range NEXT USED [PRIMARY];
ALTER PARTITION FUNCTION pf_date_range() SPLIT RANGE ('2020/11/25');

▋How to remove the Partition

ALTER PARTITION FUNCTION pf_date_range() MERGE RANGE(CONVERT(DATETIME, '2020-11-23'));

Notice that the parameter for RANGE() must be DATETIME.

▋Remove Partition Scheme and Function

DROP PARTITION SCHEME ps_date_range;
DROP PARTITION FUNCTION pf_date_range;

▌Reference

▋How to Partition an existing SQL Server Table

2020年11月22日星期日

[Notes] 2020 Meet Taipei 創新創業嘉年華 VVIP Day

這是第一次參加 Meet Taipei 展覽，記錄一下幾個印象深刻的幾種軟體服務。

官網：https://meettaipei.tw/

報到後收到厚厚的導覽書。

線上線下整合

DEEP INSIGHT：整合零售的數據專家，在實體通路商店整合POS機並安裝Sensor，辨識熟客以及以歷史數據分析來做即時推薦和行銷。雖然這一塊很多大供應商都已卡位，不過對於台灣的市場來說，本地的軟硬體支援會更即時和符合當地文化，所以我認為這一塊還是相當有機會的。

Daione：自動化跨境口碑行銷工具，媒合日韓東南亞的網紅，來做一站式的網紅口碑行銷。這個點子相當有趣，由他們的介紹，還可以透過服務來觀察效益；在這個對於網紅，Youtuber 接受度相當高的新世代，未來可能會成為主力的行銷工具也不一定呢。
教育機器人

女媧創造 - 凱比機器人：這個機器人很可愛，操作介面是Android based，擁有對談，互動，教學的功能。由於在會場僅有聽業務介紹，沒有實際操作，過了幾天我還實際跑去三創一樓某電信商有實機可簡單試玩。我相信教育機器人會逐漸變成未來教育的輔助工具 !! 期望凱比越來越進步囉!
其他雲端服務

有線上管理訂單，訂餐，Email行銷，資產和投資管理（有的是媒介貸款 XDDD）等族繁不及備載，甚至還有媒介健身教練的服務。

你訂NIDIN 的現場人員非常熱情，演練了一段從 Web下單到POS機接單，再由標籤機印出飲料品項的流程。這類關鍵的差異性在於UI/UX以及整合客戶既有軟硬體的速度。由於我們辦公室還滿常使用NIDIN，也祝福他們市場越做越大囉!

一個下午下來，已經眼花撩亂啦! 萬事起頭難，成功沒有捷徑，建立客戶的信任，聯盟，安全的服務和適當的行銷，是邁向成功（或存活 XDD）的必要條件吧！

文章分類

2020年11月26日 星期四

[SQL Server] Partition table

2020年11月22日 星期日

[Notes] 2020 Meet Taipei 創新創業嘉年華 VVIP Day

2020年11月26日星期四

2020年11月22日星期日