spike mode for dynamic series adjustment #70

yomek33 · 2024-07-30T19:25:34Z

Add spike mode for metrics generation and cycling. Other two modes: #64

spike

usage
./avalanche --series-operation-mode=spike --series-change-interval=180 --series-count=100 --spike-multiplier=1.5
This mode periodically increases the series count by a spike multiplier on one tick and then returns it to the original count on the next tick. This pattern repeats indefinitely.

yomek33 · 2024-07-30T20:24:13Z

Currently, the implementation alternates between the original series count and the spiked series count at each seriesChangeInterval. This approach was chosen for its simplicity. However, if we want to have the original series count for the first 3 minutes and then the spiked series count for the next 5 minutes, we could achieve this by adding a new flag. This would allow for more flexible and customized patterns for metrics generation. If this is not a concern, we can proceed with the current implementation.

cstyan

just some minor comments

and now that I think about it some more, we can probably combine this mode and double-halve, since this is essentially double-halve with a spike-multiplier of 2

cstyan · 2024-08-01T00:46:35Z

metrics/serve.go

+		if inSpike {
+			inSpike = false
+			*currentSeriesCount = initialSeriesCount
+		} else {
+			inSpike = true
+			*currentSeriesCount = int(float64(initialSeriesCount) * spikeMultiplier)
+		}


you could simplify this by just checking

if *currentSeriescount > initialSeriesCount { set to initial series count continue } multiply by the spike multiplier

cstyan · 2024-08-01T00:47:13Z

metrics/serve.go

+		select {
+		case updateNotify <- struct{}{}:
+		default:


not sure what's happening here, and looking at the other handle functions we're doing the same, what do we need this select and the update notify channel for?

the updateNotify channel are used here to:

avalanche/metrics/write.go

Lines 140 to 149 in 4b732e1

select {

case <-c.config.UpdateNotify:

log.Println("updating remote write metrics")

tss, err = collectMetrics()

if err != nil {

merr.Add(err)

}

default:

tss = updateTimetamps(tss)

}

Each time a notification is received, the metrics are updated and logged.

You're right! We learned something. It looks like updateNotify is only used for the write mode, where avalanche is remote writing somewhere. So this isn't needed for what we're trying to do with prombench, but it is required in general 👍

cstyan · 2024-08-02T22:05:46Z

might need to rebase or merge in master, looks like there might be a merge conflict?

Signed-off-by: yomek33 <[email protected]>

yomek33 · 2024-08-06T13:30:18Z

might need to rebase or merge in master, looks like there might be a merge conflict?

@cstyan I fixed!🙆‍♀️

Signed-off-by: yomek33 <[email protected]>

cstyan

just some minor comments, only one thing that should change for now IMO

cstyan · 2024-08-21T01:05:30Z

metrics/serve_test.go

+			assert.Equal(t, expectedCount, currentCount, "Halved series count should be %d but got %d", int(expectedCount), currentCount)
+		} else {
+			currentCount := countSeries(t, promRegistry)
+			expectedCount := initialSeriesCount * spikeMultiplier
+			assert.Equal(t, int(expectedCount), currentCount, "Doubled series count should be %d but got %d", int(expectedCount), float64(currentCount))


we should change the messages here, you could put "multiplied the series count by %d, should be %d but got %d" for example

cstyan · 2024-08-21T01:09:18Z

cmd/avalanche.go

@@ -91,7 +99,7 @@ func main() {

 	stop := make(chan struct{})
 	defer close(stop)
-	updateNotify, err := metrics.RunMetrics(*metricCount, *labelCount, *seriesCount, *seriesChangeRate, *maxSeriesCount, *minSeriesCount, *metricLength, *labelLength, *valueInterval, *labelInterval, *metricInterval, *seriesChangeInterval, *seriesOperationMode, *constLabels, stop)
+	updateNotify, err := metrics.RunMetrics(*metricCount, *labelCount, *seriesCount, *seriesChangeRate, *maxSeriesCount, *minSeriesCount, *metricLength, *labelLength, *valueInterval, *labelInterval, *metricInterval, *seriesChangeInterval, *spikeMultiplier, *seriesOperationMode, *constLabels, stop)


We don't need to do it now, but we should look at breaking RunMetrics into separate functions for each of the run modes. The current functions such as this new handleSpikeMode could create the other go routines that we create within RunMetrics currently.

This would mean we don't have to pass so many parameters around across a few functions.

Signed-off-by: yomek33 <[email protected]>

yomek33 force-pushed the series-spike branch from 9804cf3 to 045bfd5 Compare July 30, 2024 19:28

yomek33 marked this pull request as ready for review July 30, 2024 20:13

cstyan reviewed Aug 1, 2024

View reviewed changes

yomek33 force-pushed the series-spike branch from eaed89f to e4a5e41 Compare August 2, 2024 12:48

yomek33 added 2 commits August 6, 2024 22:28

feat: Add spike mode for dynamic series adjustment

47eeb45

Signed-off-by: yomek33 <[email protected]>

FIX logic better

84b0abb

Signed-off-by: yomek33 <[email protected]>

yomek33 force-pushed the series-spike branch from e4a5e41 to 84b0abb Compare August 6, 2024 13:29

yomek33 force-pushed the series-spike branch 3 times, most recently from 69d0f7d to 98823c6 Compare August 7, 2024 04:17

format

b7277dc

Signed-off-by: yomek33 <[email protected]>

yomek33 force-pushed the series-spike branch from 98823c6 to b7277dc Compare August 7, 2024 04:39

cstyan reviewed Aug 21, 2024

View reviewed changes

Improve text in TestRunMetricsSpikeChange

d4f9640

Signed-off-by: yomek33 <[email protected]>

cstyan merged commit 3558d56 into prometheus-community:main Aug 22, 2024
6 checks passed

yomek33 mentioned this pull request Aug 26, 2024

Refactor RunMetrics into Separate Functions for Each Run Mode #73

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spike mode for dynamic series adjustment #70

spike mode for dynamic series adjustment #70

yomek33 commented Jul 30, 2024 •

edited

Loading

yomek33 commented Jul 30, 2024

cstyan left a comment

cstyan Aug 1, 2024

cstyan Aug 1, 2024

yomek33 Aug 1, 2024

cstyan Aug 2, 2024

cstyan commented Aug 2, 2024

yomek33 commented Aug 6, 2024

cstyan left a comment

cstyan Aug 21, 2024

yomek33 Aug 22, 2024

cstyan Aug 21, 2024

	select {
	case <-c.config.UpdateNotify:
	log.Println("updating remote write metrics")
	tss, err = collectMetrics()
	if err != nil {
	merr.Add(err)
	}
	default:
	tss = updateTimetamps(tss)
	}

spike mode for dynamic series adjustment #70

spike mode for dynamic series adjustment #70

Conversation

yomek33 commented Jul 30, 2024 • edited Loading

spike

yomek33 commented Jul 30, 2024

cstyan left a comment

Choose a reason for hiding this comment

cstyan Aug 1, 2024

Choose a reason for hiding this comment

cstyan Aug 1, 2024

Choose a reason for hiding this comment

yomek33 Aug 1, 2024

Choose a reason for hiding this comment

cstyan Aug 2, 2024

Choose a reason for hiding this comment

cstyan commented Aug 2, 2024

yomek33 commented Aug 6, 2024

cstyan left a comment

Choose a reason for hiding this comment

cstyan Aug 21, 2024

Choose a reason for hiding this comment

yomek33 Aug 22, 2024

Choose a reason for hiding this comment

cstyan Aug 21, 2024

Choose a reason for hiding this comment

yomek33 commented Jul 30, 2024 •

edited

Loading