chore: reducing coverage loss #619

jarulraj · 2023-03-20T04:38:12Z

Moving to a different pattern for handling exceptions (at the query executor instead of doing it in each operator) -- https://www.youtube.com/watch?v=8kTlzR4HhWo&list=PLqapmkczup-VyUEZW8adgVWX088cToWlX&index=4
Ref: https://www.reddit.com/r/learnpython/comments/yxcqib/exception_handling_best_practices/
Abstract tests:

https://github.com/georgia-tech-db/eva/blob/1d826440368682ce589735dba3cb87365d98d2b2/test/udfs/test_abstract_udf.py#L26-L40

gaurav274 · 2023-03-21T07:35:31Z

eva/expression/comparison_expression.py

+            rbatch
+        ), f"Left and Right batch does not have equal elements: left: {len(lbatch)} right: {len(rbatch)}"
+
+        assert self.etype in [


Won't keeping it in the else clause easier to maintain? Similar to the switch clause in cpp. Future changes won't need to worry about modifying assert. What do you think?

Good idea. But, it will be harder to hit the else without a synthetic test case (from a coverage standpoint).

But, if the assert is not updated and a new expression type is added, then they will have to update the assert before passing the test case. So, they will be forced to update the assert

How about we move the assert condition to the test case? Any update to the logic will fail the test case and will require updating it.
It is a one-time effort, either we do it in the code using assert or in the test. I prefer test from readability perspective. What do you think?

I am not entirely sure if I understand. Can you take care of this change?

xzdandy · 2023-03-22T04:55:00Z

eva/binder/statement_binder.py

Like eva/catalog/catalog_manager.py, shall we also replace logger.execption and raise BinderError with assert for consistency?

xzdandy · 2023-03-22T05:00:52Z

eva/executor/delete_executor.py

-        except Exception as e:
-            logger.error(e)
-            raise ExecutorError(e)
+        if table_catalog.table_type == TableType.STRUCTURED_DATA:


We do not need this if check, the previous assert ensures that table_catalog.table_type is TableType.STRUCTURED_DATA

Yes, resolved.

xzdandy · 2023-03-22T05:10:56Z

eva/executor/lateral_join_executor.py

-            raise ExecutorError(e)
-
-    def __call__(self, *args, **kwargs) -> Generator[Batch, None, None]:
-        yield from self.exec(*args, **kwargs)


Is ray supporting lateral join now? I think the __call__ is needed for ray.

Not sure about this \ cc @kaushikravichandran @jiashenC @gaurav274

Even if it were the case, this code was not being used in any of the test cases and repeated in a few operators.

Ideally, I think it could be triggered by Ray if there is a LateralJoin under Projection. Maybe we can move this to the AbstractExecutor to improve the coverage? I think it will anyway have the same function implementation for any executor.

xzdandy · 2023-03-22T05:13:34Z

eva/executor/load_multimedia_executor.py

-                    raise ExecutorError(
-                        f"StorageEngine {storage_engine} create call failed"
-                    )
+                storage_engine.create(table_obj)


Here do we still need to check the success from create? We probably don't want the next write if create failed.

Changing this logic breaks these two test cases -- test_should_fail_to_load_videos_with_same_path and test_should_fail_to_load_images_with_same_path

storage_engine = StorageEngine.factory(table_obj) if do_create: create_status = storage_engine.create(table_obj) if create_status: storage_engine.write( table_obj, Batch(pd.DataFrame({"file_path": valid_files})), )

\cc @gaurav274

It is fine. create will raise error

xzdandy · 2023-03-22T05:22:54Z

eva/expression/abstract_expression.py

-            setattr(result, k, deepcopy(v, memo))
-        return result
-
-    def copy(self):


Is copy not needed anymore?

It was not being used

xzdandy · 2023-03-22T05:26:16Z

eva/models/storage/batch.py

+        for column in by:
+            if column not in self._frames.columns:
+                raise KeyError(
+                    "Can not orderby non-projected column: {}".format(column)


Can we also remove this raise?

jiashenC · 2023-03-23T01:24:40Z

eva/binder/statement_binder.py

-                        raise BinderError("Index input needs to be float32.")
-                    if not len(output.array_dimensions) == 2:
-                        raise BinderError("Index input needs to be 2 dimensional.")
+        assert IndexType.is_faiss_index_type(


Just curious, if we support index type other than Faiss later, we still need to do it with if else right (assuming we have some index specific checks)?

gaurav274 · 2023-03-23T03:43:37Z

This PR affects ray. However, upon my testing ray does not work on master anyways. So, planning to take care of it in another PR.

aryan-rajoria and others added 30 commits February 9, 2023 19:21

adding delete operation

c9afc12

Adding Insert Statement

34dfbf7

checkpoint

9fa9857

supporting multiple entries

ebe26d3

implemented for structured data error

bc722dd

adding parser visitor for delete

0e29858

delete executor

c1a7864

delete plan and rules

5ac631a

adding delete to plan executor

3238e95

change position of LogicalDelete

02a1d28

logical delimeter

01181b5

delete test case

562a7ca

adding test case

9887732

adding test case

d2a1a3d

adding delete testcase

f09c613

adding predicate to delete executor

79a6168

adding delete to Image storage

5ce1991

bug fix in delete

91d7b06

fixing testcase

0aac934

adding test case for insert statement

ee48803

remove order_by from statement_binder.py

fc2f243

better variable names, using Batch

343a4a2

error message for insert

121451f

removing order_by and limit from delete

5b47c15

remove order_by and limit

8c75a5e

use f-string

6772cd0

adding to changelog

7a10d67

removing commit messages

1a4204f

formatting

e96d3a4

fixing comments

640e7ed

jarulraj added 5 commits March 21, 2023 02:00

checkpoint

508e4a7

checkpoint

31dcd9a

checkpoint

7ac35d9

checkpoint

53c3fec

checkpoint

356310b

gaurav274 reviewed Mar 21, 2023

View reviewed changes

jarulraj added 7 commits March 21, 2023 03:45

checkpoint

c5b6c77

checkpoint

03c9b50

checkpoint

212e001

checkpoint

1cda24c

checkpoint

bbff19b

checkpoint

c104ec3

try to run tests in parallel

36d1946

xzdandy reviewed Mar 22, 2023

View reviewed changes

jarulraj added 4 commits March 22, 2023 01:38

checkpoint

f15e0a3

checkpoint

bfb344d

checkpoint

7d89836

checkpoint

b586931

jiashenC reviewed Mar 23, 2023

View reviewed changes

gaurav274 added 2 commits March 22, 2023 23:02

minor fix for ray to work

8f43895

ray fixes

a57f720

gaurav274 merged commit 77c07d7 into master Mar 23, 2023

gaurav274 deleted the coverage branch March 23, 2023 04:13

jarulraj mentioned this pull request Apr 3, 2023

[RELEASE]: v0.1.5 #629

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: reducing coverage loss #619

chore: reducing coverage loss #619

jarulraj commented Mar 20, 2023 •

edited

Loading

gaurav274 Mar 21, 2023

jarulraj Mar 21, 2023

jarulraj Mar 21, 2023

gaurav274 Mar 21, 2023

jarulraj Mar 21, 2023

xzdandy Mar 22, 2023 •

edited

Loading

jarulraj Mar 22, 2023

xzdandy Mar 22, 2023

jarulraj Mar 22, 2023

xzdandy Mar 22, 2023

jarulraj Mar 22, 2023

jarulraj Mar 22, 2023

jiashenC Mar 23, 2023

xzdandy Mar 22, 2023 •

edited

Loading

jarulraj Mar 22, 2023

jarulraj Mar 22, 2023

gaurav274 Mar 23, 2023

xzdandy Mar 22, 2023

jarulraj Mar 22, 2023

xzdandy Mar 22, 2023

jarulraj Mar 22, 2023

jiashenC Mar 23, 2023

gaurav274 Mar 23, 2023

gaurav274 commented Mar 23, 2023

chore: reducing coverage loss #619

chore: reducing coverage loss #619

Conversation

jarulraj commented Mar 20, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xzdandy Mar 22, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xzdandy Mar 22, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gaurav274 commented Mar 23, 2023

jarulraj commented Mar 20, 2023 •

edited

Loading

xzdandy Mar 22, 2023 •

edited

Loading

xzdandy Mar 22, 2023 •

edited

Loading