Investigate Java 21 and Jruby compatibility #15342

roaksoax · 2023-09-22T15:06:31Z

Java 21 is now available and we would like to make it the default for Logstash. However, we need to investigate whether it is possible provided Jruby supports it.

Deprecation list: https://docs.oracle.com/en/java/javase/21/docs/api/deprecated-list.html
Dependant tasks:

Fix argument error in JRuby and JDK 21 enumerable.map raise wrong number of arguments only when Ruby script is executed from Gradle using JDK 21 jruby/jruby#8061

Depending tasks:

Update Derby Update Derby logstash-plugins/logstash-integration-jdbc#148 fixed by Vendor locally built Derby logstash-plugins/logstash-integration-jdbc#155 and Adds dependency to Derby (without renaming) and also add derbytools dependency logstash-plugins/logstash-integration-jdbc#160

Other Tasks

Adaptations to run on JDK 21 JDK 21 move #15719
Test and verify all plugins are supported on newer JDK21 (with current jruby). Update bundled JDK to 21 #16055
Bundle JDK 21 and fix all TODO (introduced in JDK 21 move #15719) related to getId been deprecated in JDK 19 and replaced by threadId() starting from JDK 21
Check if performance drop due to JDK 21 removal of Preventive GC flag, ES start seeing throubles from JDK 20

The text was updated successfully, but these errors were encountered:

andsel · 2024-01-25T07:51:34Z

As reported in jruby/jruby#8061 (comment) JDK 21 LinkedHashMap introduce a new method (map), not present in JDK 17 and that interfere with JRuby map method.

andsel · 2024-02-08T08:37:06Z

As reported in jruby/jruby#8061 (comment) the fix will be included in JRuby 9.4.6.0.
The temporary fix would be to add

java.util.LinkedHashSet.remove_method(:map) rescue nil

in rspec bootstrap script:

logstash/lib/bootstrap/rspec.rb

Line 18 in 4e98aa8

require_relative "environment"

andsel · 2024-03-28T17:03:39Z

Analysis of removal of Preventive GC flag on Logstash

Definition of which problem preventive GC was intended to resolve

JDK 17 introduced the flag G1UsePreventiveGC to resolve a problem in G1 evacuation where there are a lot of short lived humongous objects (humongous means object occupation bigger than 1/2 of a region size). Discussed in https://tschatzl.github.io/2021/09/16/jdk17-g1-parallel-gc-changes.html the problem consists in 0 objects copied during evacuation phase because the count of such object raised so quickly and there isn't Eden or Survivor regions available to move, so needs a FullGC (that Stop The World) do to in-place compaction.
The flag was introduced to do some preventive unscheduled GC cycles to avoid reach the situation of humongous objects saturate the humongous regions, so essentially to preserve space to copy object during evacuation and avoid a FullGC.
With JDK 20 the flag was deprecated and defaulted to false, with JDK 21 is has been removed.

Elasticsearch use case

Elasticsearch data node load a lot of 4MB byte[] chuncks of data to be passed down to ML node(but happens also in other case, not limited only to ML case). This generate a lot of humogous allocations (humongous objects are object with size >= 1/2 of region size), in general a spike in allocations would generate an OOM error in the JVM, but ES is able to protect against it with a circuit breaker, and exactly that showed up with a lot circuit breaker exceptions with the memory stying high insted of getting freed and kept lower thanks to the G1 Preventive Collection phases.

How ES solved the issue
ES is resolving this trying to allocate less humongous objects.

Logstash use case

Logstash has some peculiarities:

allocation is governed by the environment, the clients push data into inputs or is pulled in from inputs.
there isn't any explicit circuit breaker to avoid memory exhaustion.
the limitation mechanism is the in-memory queue, where if the upstream is going too fast then it works as a bounding mechanism by blocking.

Queue full case
If the queue is full and is limiting the input, then at a certain point the allocation rate is not high, given that the references are in queue and stay there for relatively long periods, likely those objects transition into tenured regions (old generation) and doesn't have any benefit from preventive GCs.

So from this perspective having or not preventive GCs doesn't provide any improvement.

Queue empty and fast consumers
In this case the queue is almost full, consumers are able to cope with producers. When allocation rate is high and pipelines queues have enogth space to keep live all the events (big objects >= 2MB), being that there isn't any circuit breaker protection the preventive GCs offer limited relieve, JVM hosting Logstash is destined to go OOM without preemptively limiting the allocation rate.

Also in this case having or not preventive GCs doesn't provide improvements.

Considerations

Given the discussion above, preventive GCs doesn't play an important role for Logstash memory management.

How I've done some tests

Used the following pipeline, which is pretty fast and keeps the queue mostly empty:

input {
  http {
    response_headers => {"Content-Type" => "application/json"}
    ecs_compatibility => disabled
  }
}
output {
  sink {}
}

Created a file of 4MB single line of text.
Run wrk with following Lua script:

wrk.method = "POST"
local f = io.open("input_sample.txt", "r")
wrk.body   = f:read("*all")

wrk --threads 4 --connections 12 -d10m -s wrk_send_file.lua --latency http://localhost:8080

andsel · 2024-04-03T15:55:12Z

Reopen because inadvertently closed by #15719

roaksoax · 2024-04-09T16:14:49Z

Closing this issue since now Logstash will support JDK 21. The discussion to decide whether we make it default its followed on a different thread.

roaksoax assigned andsel Oct 6, 2023

roaksoax assigned edmocosta Dec 7, 2023

dliappis mentioned this issue Dec 13, 2023

[ci] Add Java 21 option for multi JDK CI pipeline #15691

Merged

andsel mentioned this issue Dec 21, 2023

JDK 21 move #15719

Merged

3 tasks

andsel linked a pull request Dec 21, 2023 that will close this issue

JDK 21 move #15719

Merged

3 tasks

andsel closed this as completed in #15719 Apr 3, 2024

andsel reopened this Apr 3, 2024

roaksoax closed this as completed Apr 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate Java 21 and Jruby compatibility #15342

Investigate Java 21 and Jruby compatibility #15342

roaksoax commented Sep 22, 2023 •

edited by andsel

Loading

andsel commented Jan 25, 2024

andsel commented Feb 8, 2024

andsel commented Mar 28, 2024

andsel commented Apr 3, 2024

roaksoax commented Apr 9, 2024

Investigate Java 21 and Jruby compatibility #15342

Investigate Java 21 and Jruby compatibility #15342

Comments

roaksoax commented Sep 22, 2023 • edited by andsel Loading

andsel commented Jan 25, 2024

andsel commented Feb 8, 2024

andsel commented Mar 28, 2024

Analysis of removal of Preventive GC flag on Logstash

Definition of which problem preventive GC was intended to resolve

Elasticsearch use case

Logstash use case

Considerations

How I've done some tests

andsel commented Apr 3, 2024

roaksoax commented Apr 9, 2024

roaksoax commented Sep 22, 2023 •

edited by andsel

Loading