Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[fix](hdfs) Fix hdfsExists that return staled root cause #27991

Merged
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
19 changes: 16 additions & 3 deletions be/src/io/fs/hdfs_file_system.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -227,15 +227,28 @@ Status HdfsFileSystem::delete_internal(const Path& path, int is_recursive) {
Status HdfsFileSystem::exists_impl(const Path& path, bool* res) const {
CHECK_HDFS_HANDLE(_fs_handle);
Path real_path = convert_path(path, _fs_name);
#ifdef USE_HADOOP_HDFS
// HACK: the HDFS native client won't clear the last exception as expected so
// `hdfsGetLastExceptionRootCause` might return a staled root cause. Save the
// last root cause here and verify after hdfsExists returns a non-zero code.
//
// See details:
// https://github.com/apache/hadoop/blob/5cda162a804fb0cfc2a5ac0058ab407662c5fb00/
// hadoop-hdfs-project/hadoop-hdfs-native-client/src/main/native/libhdfs/jni_helper.c#L795
char* former_root_cause = hdfsGetLastExceptionRootCause();
#endif
int is_exists = hdfsExists(_fs_handle->hdfs_fs, real_path.string().c_str());
#ifdef USE_HADOOP_HDFS
// when calling hdfsExists() and return non-zero code,
// if root_cause is nullptr, which means the file does not exist.
// if root_cause is not nullptr, which means it encounter other error, should return.
// NOTE: not for libhdfs3 since it only runs on MaxOS, don't have to support it.
char* root_cause = hdfsGetLastExceptionRootCause();
if (root_cause != nullptr) {
return Status::IOError("failed to check path existence {}: {}", path.native(), root_cause);
if (is_exists != 0) {
char* root_cause = hdfsGetLastExceptionRootCause();
if (root_cause != nullptr && root_cause != former_root_cause) {
return Status::IOError("failed to check path existence {}: {}", path.native(),
root_cause);
}
}
#endif
*res = (is_exists == 0);
Expand Down