Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dep(*): move code of sync_diff_inspector from tidb-tools to tiflow #11671

Merged
merged 23 commits into from
Jan 7, 2025
Merged
Show file tree
Hide file tree
Changes from 16 commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion go.mod
Original file line number Diff line number Diff line change
Expand Up @@ -62,6 +62,7 @@ require (
github.com/mailru/easyjson v0.7.7
github.com/mattn/go-shellwords v1.0.12
github.com/modern-go/reflect2 v1.0.2
github.com/olekukonko/tablewriter v0.0.5
github.com/phayes/freeport v0.0.0-20180830031419-95f893ade6f2
github.com/pierrec/lz4/v4 v4.1.18
github.com/pingcap/check v0.0.0-20211026125417-57bd13f7b5f0
Expand Down Expand Up @@ -345,7 +346,7 @@ require (
github.com/shurcooL/httpfs v0.0.0-20230704072500-f1e31cf0ba5c // indirect
github.com/shurcooL/httpgzip v0.0.0-20190720172056-320755c1c1b0 // indirect
github.com/shurcooL/vfsgen v0.0.0-20200824052919-0d455de96546 // indirect
github.com/siddontang/go v0.0.0-20180604090527-bdc77568d726 // indirect
github.com/siddontang/go v0.0.0-20180604090527-bdc77568d726
joechenrh marked this conversation as resolved.
Show resolved Hide resolved
github.com/siddontang/go-log v0.0.0-20180807004314-8d05993dda07 // indirect
github.com/sirupsen/logrus v1.9.3 // indirect
github.com/spaolacci/murmur3 v1.1.0 // indirect
Expand Down
2 changes: 2 additions & 0 deletions go.sum
Original file line number Diff line number Diff line change
Expand Up @@ -830,6 +830,8 @@ github.com/niemeyer/pretty v0.0.0-20200227124842-a10e7caefd8e/go.mod h1:zD1mROLA
github.com/nxadm/tail v1.4.4/go.mod h1:kenIhsEOeOJmVchQTgglprH7qJGnHDVpk1VPCcaMI8A=
github.com/nxadm/tail v1.4.8 h1:nPr65rt6Y5JFSKQO7qToXr7pePgD6Gwiw05lkbyAQTE=
github.com/nxadm/tail v1.4.8/go.mod h1:+ncqLTQzXmGhMZNUePPaPqPvBxHAIsmXswZKocGu+AU=
github.com/olekukonko/tablewriter v0.0.5 h1:P2Ga83D34wi1o9J6Wh1mRuqd4mF/x/lgBS7N7AbDhec=
github.com/olekukonko/tablewriter v0.0.5/go.mod h1:hPp6KlRPjbx+hW8ykQs1w3UBbZlj6HuIJcUGPhkA7kY=
github.com/onsi/ginkgo v1.6.0/go.mod h1:lLunBs/Ym6LB5Z9jYTR76FiuTmxDTDusOGeTQH+WWjE=
github.com/onsi/ginkgo v1.12.1/go.mod h1:zj2OWP4+oCPe1qIXoGWkgMRwljMUYCdkwsT2108oapk=
github.com/onsi/ginkgo v1.13.0/go.mod h1:+REjRxOmWfHCjfv9TTWB1jD1Frx4XydAD3zm1lskyM0=
Expand Down
23 changes: 23 additions & 0 deletions sync_diff_inspector/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
# sync-diff-inspector

sync-diff-inspector is a tool for comparing two database's data.

## How to use

```shell
Usage of diff:
-V, --version print version of sync_diff_inspector
-L, --log-level string log level: debug, info, warn, error, fatal (default "info")
-C, --config string Config file
-T, --template string <dm|norm> export a template config file
--dm-addr string the address of DM
--dm-task string identifier of dm task
--check-thread-count int how many goroutines are created to check data (default 4)
--export-fix-sql set true if want to compare rows or set to false will only compare checksum (default true)
```

For more details you can read the [config.toml](./config/config.toml), [config_sharding.toml](./config/config_sharding.toml) and [config_dm.toml](./config/config_dm.toml).

## Documents
- `zh`: [Overview in Chinese](https://docs.pingcap.com/zh/tidb/stable/sync-diff-inspector-overview)
- `en`: [Overview in English](https://docs.pingcap.com/tidb/stable/sync-diff-inspector-overview)
251 changes: 251 additions & 0 deletions sync_diff_inspector/checkpoints/checkpoints.go
Original file line number Diff line number Diff line change
@@ -0,0 +1,251 @@
// Copyright 2021 PingCAP, Inc.
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
// http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// See the License for the specific language governing permissions and
// limitations under the License.

package checkpoints

import (
"container/heap"
"context"
"encoding/json"
"os"
"sync"

"github.com/pingcap/errors"
"github.com/pingcap/log"
"github.com/pingcap/tiflow/sync_diff_inspector/chunk"
"github.com/pingcap/tiflow/sync_diff_inspector/config"
"github.com/pingcap/tiflow/sync_diff_inspector/report"
"github.com/siddontang/go/ioutil2"
"go.uber.org/zap"
)

const (
// SuccessState means
// for chunk: this chunk's data is equal
// for table: means this all chunk in this table is equal(except ignore chunk)
SuccessState = "success"

// FailedState means
// for chunk: this chunk's data is not equal
// for table: some chunks' data is not equal or some chunk check failed in this table
FailedState = "failed"

// IgnoreState means
// for chunk: this chunk is ignored. if it is Empty chunk, will ignore some chunk
// for table: don't have this state
IgnoreState = "ignore"
)

// Node is the struct for node
type Node struct {
State string `json:"state"` // indicate the state ("success" or "failed") of the chunk

ChunkRange *chunk.Range `json:"chunk-range"`
IndexID int64 `json:"index-id"`
}

// GetID returns id from the node
func (n *Node) GetID() *chunk.CID { return n.ChunkRange.Index }

// GetState returns the state from the node
func (n *Node) GetState() string { return n.State }

// GetTableIndex returns table index
func (n *Node) GetTableIndex() int { return n.ChunkRange.Index.TableIndex }

// GetBucketIndexLeft returns BucketIndexLeft
func (n *Node) GetBucketIndexLeft() int { return n.ChunkRange.Index.BucketIndexLeft }

// GetBucketIndexRight returns BucketIndexRight
func (n *Node) GetBucketIndexRight() int { return n.ChunkRange.Index.BucketIndexRight }

// GetChunkIndex returns ChunkIndex
func (n *Node) GetChunkIndex() int { return n.ChunkRange.Index.ChunkIndex }

// IsAdjacent represents whether the next node is adjacent node.
// it's the important logic for checkpoint update.
// we need keep this node save to checkpoint in global order.
func (n *Node) IsAdjacent(next *Node) bool {
if n.GetTableIndex() == next.GetTableIndex()-1 {
if n.ChunkRange.IsLastChunkForTable() && next.ChunkRange.IsFirstChunkForTable() {
return true
}
return false
}
if n.GetTableIndex() == next.GetTableIndex() {
// same table
if n.GetBucketIndexRight() == next.GetBucketIndexLeft()-1 {
if n.ChunkRange.IsLastChunkForBucket() && next.ChunkRange.IsFirstChunkForBucket() {
return true
}
return false
}
if n.GetBucketIndexLeft() == next.GetBucketIndexLeft() {
return n.GetChunkIndex() == next.GetChunkIndex()-1
}
return false
}
return false
}

// IsLess represents whether the cur node is less than next node.
func (n *Node) IsLess(next *Node) bool {
if n.GetTableIndex() < next.GetTableIndex() {
return true
}
if n.GetTableIndex() == next.GetTableIndex() {
if n.GetBucketIndexLeft() <= next.GetBucketIndexLeft()-1 {
return true
}
if n.GetBucketIndexLeft() == next.GetBucketIndexLeft() {
return n.GetChunkIndex() < next.GetChunkIndex()
}
return false
}
return false
}

// heap maintain a Min Heap, which can be accessed by multiple threads and protected by mutex.
type nodeHeap struct {
Nodes []*Node
CurrentSavedNode *Node // CurrentSavedNode save the minimum checker chunk, updated by `GetChunkSnapshot` method
mu *sync.Mutex // protect critical section
}

// Checkpoint provide the ability to restart the sync-diff process from the
// latest previous exit point (due to error or intention).
type Checkpoint struct {
hp *nodeHeap
}

// SavedState contains the information of the latest checked chunk and state of `report`
// When sync-diff start from the checkpoint, it will load this information and continue running
type SavedState struct {
Chunk *Node `json:"chunk-info"`
Report *report.Report `json:"report-info"`
}

// InitCurrentSavedID the method is only used in initialization without lock, be cautious
func (cp *Checkpoint) InitCurrentSavedID(n *Node) {
cp.hp.CurrentSavedNode = n
}

// GetCurrentSavedID returns the saved id with lock
func (cp *Checkpoint) GetCurrentSavedID() *Node {
cp.hp.mu.Lock()
defer cp.hp.mu.Unlock()
return cp.hp.CurrentSavedNode
}

// Insert inserts a new node
func (cp *Checkpoint) Insert(node *Node) {
cp.hp.mu.Lock()
heap.Push(cp.hp, node)
cp.hp.mu.Unlock()
}

// Len gets the length of the heap
func (hp *nodeHeap) Len() int { return len(hp.Nodes) }

// Less determines which is more priority than another
func (hp *nodeHeap) Less(i, j int) bool {
return hp.Nodes[i].IsLess(hp.Nodes[j])
}

// Swap implementation of swap for the heap interface
func (hp *nodeHeap) Swap(i, j int) {
hp.Nodes[i], hp.Nodes[j] = hp.Nodes[j], hp.Nodes[i]
}

// Push implementation of push for the heap interface
func (hp *nodeHeap) Push(x interface{}) {
hp.Nodes = append(hp.Nodes, x.(*Node))
}

// Pop implementation of pop for heap interface
func (hp *nodeHeap) Pop() (item interface{}) {
if len(hp.Nodes) == 0 {
return
}

hp.Nodes, item = hp.Nodes[:len(hp.Nodes)-1], hp.Nodes[len(hp.Nodes)-1]
return
}

// Init initialize the Checkpoint
func (cp *Checkpoint) Init() {
hp := &nodeHeap{
mu: &sync.Mutex{},
Nodes: make([]*Node, 0),
CurrentSavedNode: &Node{
ChunkRange: &chunk.Range{
Index: chunk.GetInitCID(),
IsFirst: true,
IsLast: true,
},
},
}
heap.Init(hp)
cp.hp = hp
}

// GetChunkSnapshot get the snapshot of the minimum continuous checked chunk
func (cp *Checkpoint) GetChunkSnapshot() (cur *Node) {
cp.hp.mu.Lock()
defer cp.hp.mu.Unlock()
for cp.hp.Len() != 0 && cp.hp.CurrentSavedNode.IsAdjacent(cp.hp.Nodes[0]) {
cp.hp.CurrentSavedNode = heap.Pop(cp.hp).(*Node)
cur = cp.hp.CurrentSavedNode
}
// wait for next 10s to check
return cur
}

// SaveChunk saves the chunk to file.
func (cp *Checkpoint) SaveChunk(ctx context.Context, fileName string, cur *Node, reportInfo *report.Report) (*chunk.CID, error) {
if cur == nil {
return nil, nil
}

savedState := &SavedState{
Chunk: cur,
Report: reportInfo,
}
checkpointData, err := json.Marshal(savedState)
if err != nil {
log.Warn("fail to save the chunk to the file", zap.Any("chunk index", cur.GetID()), zap.Error(err))
return nil, errors.Trace(err)
}

if err = ioutil2.WriteFileAtomic(fileName, checkpointData, config.LocalFilePerm); err != nil {
return nil, err
}
log.Info("save checkpoint",
zap.Any("chunk", cur),
zap.String("state", cur.GetState()))
return cur.GetID(), nil
}

// LoadChunk loads chunk info from file `chunk`
func (cp *Checkpoint) LoadChunk(fileName string) (*Node, *report.Report, error) {
bytes, err := os.ReadFile(fileName)
if err != nil {
return nil, nil, errors.Trace(err)
}
n := &SavedState{}
err = json.Unmarshal(bytes, n)
if err != nil {
return nil, nil, errors.Trace(err)
}
return n.Chunk, n.Report, nil
}
Loading
Loading