-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add dp3 paper to version control #19
Conversation
12a0401
to
0fb682f
Compare
doc/paper/paper.tex
Outdated
allowing the user to quickly pivot between searching for interesting events, | ||
and replaying those events in full context. | ||
|
||
For the ML developer, dp3 supports both client/server and embedded modes of |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These aren't defined clearly IMO - is there a way to put this that does not introduce new terms?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is still outstanding
3acc1e4
to
fbe8322
Compare
doc/paper/paper.tex
Outdated
all apply to server logs as well. Many organizations employ more infrastructure | ||
engineers than all the employees in the largest robotics company, and some of | ||
them undoubtedly produce petabytes of logs and analyze them with Kafka | ||
\cite{kafka} or Spark \cite{spark}. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure the above paragraph is needed. Can't we move on to the next paragraph directly?
doc/paper/paper.tex
Outdated
traditional log databases are generally driven by organizational scale – i.e | ||
how many thousands of internal microservices there are. Organizational scale is | ||
slower-growing because it usually requires some form of sustained business | ||
success over years. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I feel that there are some untestable hypotheses in here. Could we shorten this to simply state the challenge(s) brought on by large scales of multimodal log data
doc/paper/paper.tex
Outdated
serve for ML purposes, will be image and point-cloud data. In these kinds of | ||
messages, there is effectively one column: a giant byte array. This means that | ||
row-oriented and columnar files for these kinds of data are physically pretty | ||
similar. So intuitively, it seems like the performance benefits we would hope |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"Intuitively," "it seems," "we would hope," "mostly," "perhaps less," etc. are subjective claims that are especially weak when paired together (as in this sentence). Can we omit or more clearly describe/quantify these?
extremely complex. Although one workable option is to hire SQL experts to | ||
interface between the database and the robotics developer, one of dp3's | ||
experimental goals is to avoid this solution and attempt instead to empower | ||
the robotics developer directly. Second, analytics databases are not |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If this is one of the motivations of dp3, can we indicate this somewhere previously? This project has lots of potential, but I feel that the motivations are a bit scattered in the paper
user to decide whether to associate producers with \q{devices} or | ||
\q{simulation runs} or another internal concept. dp3 supports multiple | ||
logical databases within a single instance, and operations must be scoped | ||
with a database. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"with" --> was "within" intended here?
only with producers and topics. Trees are constructed based on a start and end | ||
time, branching factor, and target leaf width in seconds. The constructor then | ||
picks a height that results in a tree satisfying the dimensional constraints. | ||
The height obtained by a default tree is 5. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"obtained by" --> "of"?
incomparable. However, support for this may become important as we develop | ||
more focus on subqueries. | ||
|
||
Continuous queries are also not supported in dp3 today, however support |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
--> "; however, support would not require much work."
Should we mention this in our feature development plans if not mentioned already?
This adds the dp3 paper to version control, along with a job to ensure the pdf matches the tex file.
No description provided.