Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add dp3 paper to version control #19

Merged
merged 1 commit into from
May 3, 2024
Merged

Add dp3 paper to version control #19

merged 1 commit into from
May 3, 2024

Conversation

wkalt
Copy link
Owner

@wkalt wkalt commented Apr 30, 2024

No description provided.

@wkalt wkalt force-pushed the task/add-paper branch 14 times, most recently from 12a0401 to 0fb682f Compare May 3, 2024 03:01
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
allowing the user to quickly pivot between searching for interesting events,
and replaying those events in full context.

For the ML developer, dp3 supports both client/server and embedded modes of

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These aren't defined clearly IMO - is there a way to put this that does not introduce new terms?

Copy link
Owner Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is still outstanding

doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
@wkalt wkalt force-pushed the task/add-paper branch 3 times, most recently from 3acc1e4 to fbe8322 Compare May 3, 2024 04:31
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
all apply to server logs as well. Many organizations employ more infrastructure
engineers than all the employees in the largest robotics company, and some of
them undoubtedly produce petabytes of logs and analyze them with Kafka
\cite{kafka} or Spark \cite{spark}.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure the above paragraph is needed. Can't we move on to the next paragraph directly?

traditional log databases are generally driven by organizational scale – i.e
how many thousands of internal microservices there are. Organizational scale is
slower-growing because it usually requires some form of sustained business
success over years.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I feel that there are some untestable hypotheses in here. Could we shorten this to simply state the challenge(s) brought on by large scales of multimodal log data

doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
serve for ML purposes, will be image and point-cloud data. In these kinds of
messages, there is effectively one column: a giant byte array. This means that
row-oriented and columnar files for these kinds of data are physically pretty
similar. So intuitively, it seems like the performance benefits we would hope
Copy link
Collaborator

@jainilajmera jainilajmera May 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"Intuitively," "it seems," "we would hope," "mostly," "perhaps less," etc. are subjective claims that are especially weak when paired together (as in this sentence). Can we omit or more clearly describe/quantify these?

doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
extremely complex. Although one workable option is to hire SQL experts to
interface between the database and the robotics developer, one of dp3's
experimental goals is to avoid this solution and attempt instead to empower
the robotics developer directly. Second, analytics databases are not
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If this is one of the motivations of dp3, can we indicate this somewhere previously? This project has lots of potential, but I feel that the motivations are a bit scattered in the paper

doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
doc/paper/paper.tex Outdated Show resolved Hide resolved
user to decide whether to associate producers with \q{devices} or
\q{simulation runs} or another internal concept. dp3 supports multiple
logical databases within a single instance, and operations must be scoped
with a database.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"with" --> was "within" intended here?

only with producers and topics. Trees are constructed based on a start and end
time, branching factor, and target leaf width in seconds. The constructor then
picks a height that results in a tree satisfying the dimensional constraints.
The height obtained by a default tree is 5.
Copy link
Collaborator

@jainilajmera jainilajmera May 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"obtained by" --> "of"?

incomparable. However, support for this may become important as we develop
more focus on subqueries.

Continuous queries are also not supported in dp3 today, however support
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

--> "; however, support would not require much work."
Should we mention this in our feature development plans if not mentioned already?

This adds the dp3 paper to version control, along with a job to ensure
the pdf matches the tex file.
@wkalt wkalt force-pushed the task/add-paper branch from fbe8322 to 93d455d Compare May 3, 2024 17:36
@wkalt wkalt merged commit 21ecb9c into main May 3, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants