Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

YQ-2824: implement basis for type inference for CSV and TSV #4071

Merged
merged 7 commits into from
Jun 7, 2024

Conversation

EgorkaZ
Copy link
Contributor

@EgorkaZ EgorkaZ commented Apr 24, 2024

Changelog entry

...

Changelog category

  • Not for changelog (changelog entry is not required)

Additional information

...

Copy link

github-actions bot commented Apr 24, 2024

2024-04-24 12:28:19 UTC Pre-commit check for e80faab has started.
2024-04-24 12:28:22 UTC Build linux-x86_64-release-asan is running...
🟢 2024-04-24 12:30:05 UTC Build successful.
2024-04-24 12:31:07 UTC Check cancelled

Copy link

github-actions bot commented Apr 24, 2024

2024-04-24 12:28:30 UTC Pre-commit check for e80faab has started.
2024-04-24 12:28:32 UTC Build linux-x86_64-release-clang14 is running...
🟢 2024-04-24 12:30:10 UTC Build successful.

Copy link

github-actions bot commented Apr 24, 2024

2024-04-24 12:30:47 UTC Pre-commit check for e80faab has started.
2024-04-24 12:30:49 UTC Build linux-x86_64-relwithdebinfo is running...
2024-04-24 12:31:07 UTC Check cancelled

@EgorkaZ EgorkaZ force-pushed the yq-type-inferencinator branch 3 times, most recently from d213493 to eb82764 Compare April 24, 2024 12:31
Copy link

github-actions bot commented Apr 24, 2024

2024-04-24 12:33:13 UTC Pre-commit check for 6923166 has started.
2024-04-24 12:33:14 UTC Build linux-x86_64-relwithdebinfo is running...
🟢 2024-04-24 12:34:46 UTC Build successful.
2024-04-24 12:36:12 UTC Tests are running...
🟢 2024-04-24 12:36:32 UTC Tests successful.

Test history

TESTS PASSED ERRORS FAILED SKIPPED MUTED?
2 2 0 0 0 0

Copy link

github-actions bot commented Apr 24, 2024

2024-04-24 12:35:03 UTC Pre-commit check for 6923166 has started.
2024-04-24 12:35:05 UTC Build linux-x86_64-release-clang14 is running...
🟢 2024-04-24 12:36:39 UTC Build successful.

Copy link

github-actions bot commented Apr 24, 2024

2024-04-24 12:35:07 UTC Pre-commit check for 6923166 has started.
2024-04-24 12:35:08 UTC Build linux-x86_64-release-asan is running...
🟢 2024-04-24 12:36:34 UTC Build successful.
2024-04-24 12:37:58 UTC Tests are running...
🔴 2024-04-24 12:38:17 UTC Some tests failed, follow the links below.

Test history

TESTS PASSED ERRORS FAILED SKIPPED MUTED?
2 1 0 1 0 0

return InferCsvTypes(std::move(file), static_cast<const TsvConfig&>(config));
case EFileFormat::Undefined:
default:
return nullptr;
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

что именно тут возвращается? не надо ли ошибку вернуть?

return false;
case arrow::Type::LARGE_LIST: // TODO: is it true?
case arrow::Type::FIXED_SIZE_LIST: // TODO: is it true?
case arrow::Type::LIST: { // TODO: is ok?
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

у нас парсинг не поддерживает контейнеры. вывод типов тоже не должен поддерживать

ydb/core/external_sources/object_storage/inference/ya.make Outdated Show resolved Hide resolved
ydb/core/external_sources/object_storage/inference/ya.make Outdated Show resolved Hide resolved
@EgorkaZ EgorkaZ force-pushed the yq-type-inferencinator branch from eb82764 to b95f1c6 Compare May 6, 2024 08:51
@EgorkaZ EgorkaZ requested a review from a team as a code owner May 6, 2024 08:51
@EgorkaZ EgorkaZ force-pushed the yq-type-inferencinator branch from b95f1c6 to 51b9802 Compare May 30, 2024 17:25
@EgorkaZ EgorkaZ requested a review from a team as a code owner May 30, 2024 17:25
Copy link

github-actions bot commented May 30, 2024

2024-05-30 17:26:57 UTC Pre-commit check for 948658e has started.
2024-05-30 17:29:21 UTC Build linux-x86_64-release-clang14 is running...
🔴 2024-05-30 17:36:40 UTC Build failed. see the build logs.

Copy link

github-actions bot commented May 30, 2024

2024-05-30 17:29:22 UTC Pre-commit check for 948658e has started.
2024-05-30 17:31:57 UTC Build linux-x86_64-relwithdebinfo is running...
🔴 2024-05-30 17:39:58 UTC Build failed. see the build logs.
🔴 2024-05-30 17:39:59 UTC Tests run skipped.

Copy link

github-actions bot commented May 30, 2024

2024-05-30 17:29:25 UTC Pre-commit check for 948658e has started.
2024-05-30 17:32:00 UTC Build linux-x86_64-release-asan is running...
🔴 2024-05-30 17:40:30 UTC Build failed. see the build logs.
🔴 2024-05-30 17:40:31 UTC Tests run skipped.

Copy link

github-actions bot commented May 31, 2024

2024-05-31 09:55:21 UTC Pre-commit check for 0ffe1b0 has started.
2024-05-31 09:58:01 UTC Build linux-x86_64-release-clang14 is running...
🟢 2024-05-31 10:35:11 UTC Build successful.

Copy link

github-actions bot commented May 31, 2024

2024-05-31 09:55:27 UTC Pre-commit check for 0ffe1b0 has started.
2024-05-31 09:57:59 UTC Build linux-x86_64-release-asan is running...
🟢 2024-05-31 10:25:57 UTC Build successful.
2024-05-31 10:26:09 UTC Tests are running...
🔴 2024-05-31 12:00:33 UTC Some tests failed, follow the links below.

Test history

TESTS PASSED ERRORS FAILED SKIPPED MUTED?
9479 9389 0 41 32 17

Copy link

github-actions bot commented May 31, 2024

2024-05-31 09:55:28 UTC Pre-commit check for 0ffe1b0 has started.
2024-05-31 09:58:10 UTC Build linux-x86_64-relwithdebinfo is running...
🟢 2024-05-31 10:29:35 UTC Build successful.
2024-05-31 10:29:48 UTC Tests are running...
🔴 2024-05-31 12:00:49 UTC Some tests failed, follow the links below.

Test history

TESTS PASSED ERRORS FAILED SKIPPED MUTED?
37033 31272 0 960 4787 14

@@ -250,7 +250,7 @@ int TTpchCommandInit::Run(TConfig& config) {
TString notNull = "";
TString createExternalDataSource;
TString external;
TString partitioning = "AUTO_PARTITIONING_MIN_PARTITIONS_COUNT";
TString partitioning = ", AUTO_PARTITIONING_MIN_PARTITIONS_COUNT";
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Похоже это в другое ревью нужно вынести, наверное случайно сюда попало

@@ -2,7 +2,7 @@

#include "yql_http_gateway.h"

#include <curl/curl.h>
#include <contrib/libs/curl/include/curl/curl.h>
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

А это у тебя локально выстрелило?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Кажется, да.

Вообще, вне contrib/ не вижу, чтобы ещё где-то было <curl/curl.h>

}

NYql::TS3Credentials::TAuthInfo authInfo{};
if (std::holds_alternative<NAuth::TAws>(meta->Auth)) {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

А остальные auth где обрабатываются?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Service account добавил. None обрабатывать не нужно, остальное не поддерживает eternal source

ydb/core/external_sources/object_storage.cpp Outdated Show resolved Hide resolved
ydb/core/external_sources/object_storage.cpp Outdated Show resolved Hide resolved
ydb/core/external_sources/object_storage.cpp Show resolved Hide resolved
ydb/core/external_sources/object_storage/events.h Outdated Show resolved Hide resolved
@@ -265,12 +268,68 @@ struct TObjectStorageExternalSource : public IExternalSource {

virtual NThreading::TFuture<std::shared_ptr<TMetadata>> LoadDynamicMetadata(std::shared_ptr<TMetadata> meta) override {
Y_UNUSED(ActorSystem);
auto format = meta->Attributes.FindPtr("format");
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Тут похоже еще сжатие не учитывается. Можно в отдельном ревью поддержать

ydb/core/external_sources/object_storage.cpp Outdated Show resolved Hide resolved
@EgorkaZ EgorkaZ force-pushed the yq-type-inferencinator branch from 23df6d0 to cc35d35 Compare June 3, 2024 16:39
Copy link

github-actions bot commented Jun 3, 2024

2024-06-03 16:42:35 UTC Pre-commit check for 3bdee6a has started.
2024-06-03 16:45:29 UTC Build linux-x86_64-release-clang14 is running...
2024-06-03 17:03:01 UTC Check cancelled

Copy link

github-actions bot commented Jun 3, 2024

2024-06-03 16:42:50 UTC Pre-commit check for 3bdee6a has started.
2024-06-03 16:45:39 UTC Build linux-x86_64-release-asan is running...
2024-06-03 17:03:00 UTC Check cancelled

Copy link

github-actions bot commented Jun 3, 2024

2024-06-03 16:42:56 UTC Pre-commit check for 3bdee6a has started.
2024-06-03 16:45:46 UTC Build linux-x86_64-relwithdebinfo is running...
2024-06-03 17:03:01 UTC Check cancelled

@EgorkaZ EgorkaZ force-pushed the yq-type-inferencinator branch from cc35d35 to 860091e Compare June 3, 2024 17:02
Copy link

github-actions bot commented Jun 5, 2024

2024-06-05 17:06:07 UTC Pre-commit check for b15befe has started.
2024-06-05 17:08:49 UTC Build linux-x86_64-relwithdebinfo is running...
🟢 2024-06-05 17:48:45 UTC Build successful.
2024-06-05 17:49:14 UTC Tests are running...
🔴 2024-06-05 19:30:16 UTC Some tests failed, follow the links below.

Test history

TESTS PASSED ERRORS FAILED SKIPPED MUTED?
38619 33799 0 8 4803 9

Copy link

github-actions bot commented Jun 5, 2024

2024-06-05 17:09:46 UTC Pre-commit check for b15befe has started.
2024-06-05 17:12:37 UTC Build linux-x86_64-release-asan is running...
🟢 2024-06-05 17:53:19 UTC Build successful.
2024-06-05 17:53:33 UTC Tests are running...
🔴 2024-06-05 19:53:47 UTC Some tests failed, follow the links below.

Test history

TESTS PASSED ERRORS FAILED SKIPPED MUTED?
10977 10907 0 29 28 13

dorooleg
dorooleg previously approved these changes Jun 6, 2024
Copy link

github-actions bot commented Jun 7, 2024

2024-06-07 08:31:37 UTC Pre-commit check for 275db51 has started.
2024-06-07 08:34:27 UTC Build linux-x86_64-release-clang14 is running...
🟢 2024-06-07 09:10:36 UTC Build successful.

Copy link

github-actions bot commented Jun 7, 2024

2024-06-07 08:31:48 UTC Pre-commit check for 275db51 has started.
2024-06-07 08:34:36 UTC Build linux-x86_64-relwithdebinfo is running...
🟢 2024-06-07 09:11:44 UTC Build successful.
2024-06-07 09:11:59 UTC Tests are running...
🔴 2024-06-07 10:50:55 UTC Some tests failed, follow the links below.

Test history

TESTS PASSED ERRORS FAILED SKIPPED MUTED?
38706 33859 0 2 4837 8

Copy link

github-actions bot commented Jun 7, 2024

2024-06-07 08:32:07 UTC Pre-commit check for 275db51 has started.
2024-06-07 08:35:02 UTC Build linux-x86_64-release-asan is running...
🟢 2024-06-07 09:18:08 UTC Build successful.
2024-06-07 09:18:24 UTC Tests are running...
🔴 2024-06-07 11:19:37 UTC Some tests failed, follow the links below.

Test history

TESTS PASSED ERRORS FAILED SKIPPED MUTED?
10972 10899 0 38 27 8

@dorooleg dorooleg removed the request for review from a team June 7, 2024 09:02
@EgorkaZ EgorkaZ merged commit c3cc29f into ydb-platform:main Jun 7, 2024
8 of 12 checks passed
@niksaveliev niksaveliev mentioned this pull request Jun 17, 2024
@StekPerepolnen StekPerepolnen mentioned this pull request Jun 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants