Skip to content

Commit

Permalink
HADOOP-18876. ABFS: Change default for fs.azure.data.blocks.buffer to…
Browse files Browse the repository at this point in the history
… bytebuffer (#6009)


The default value for fs.azure.data.blocks.buffer is changed from "disk" to "bytebuffer"

This will speed up writing to azure storage, at the risk of running out of memory
-especially if there are many threads writing to abfs at the same time and the
upload bandwidth is limited.

If jobs do run out of memory writing to abfs, change the option back to "disk"

Contributed by Anmol Asrani
  • Loading branch information
anmolanmol1234 authored Oct 9, 2023
1 parent ea3cb12 commit 666af58
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 2 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -85,7 +85,7 @@ public final class ConfigurationKeys {
/**
* What data block buffer to use.
* <br>
* Options include: "disk"(Default), "array", and "bytebuffer".
* Options include: "disk", "array", and "bytebuffer"(Default).
* <br>
* Default is {@link FileSystemConfigurations#DATA_BLOCKS_BUFFER_DEFAULT}.
* Value: {@value}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -132,11 +132,13 @@ public final class FileSystemConfigurations {
*/
public static final String DATA_BLOCKS_BUFFER_DISK = "disk";

public static final String DATA_BLOCKS_BYTEBUFFER = "bytebuffer";

/**
* Default buffer option: {@value}.
*/
public static final String DATA_BLOCKS_BUFFER_DEFAULT =
DATA_BLOCKS_BUFFER_DISK;
DATA_BLOCKS_BYTEBUFFER;

/**
* IO rate limit. Value: {@value}
Expand Down

0 comments on commit 666af58

Please sign in to comment.