-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DataFrame commands throwing java.lang.NullPointerException on example data #320
Comments
Works when running natively with
but fails when running with
Apologies, this probably belongs in the docker repo. |
Works if we read in a directory, i.e. import io.archivesunleashed._
import io.archivesunleashed.df._
val df = RecordLoader.loadArchives("/data/*", sc)
.extractValidPagesDF()
df.printSchema() |
So, is it just a documentation issue on archivesunleashed.org/aut? |
No, it can't read the i.e. this doesn't work scala> :paste
// Entering paste mode (ctrl-D to finish)
import io.archivesunleashed._
import io.archivesunleashed.df._
val df = RecordLoader.loadArchives("*.gz", sc)
.extractValidPagesDF()
df.printSchema() Or we can just say not to use it with Docker? |
I can't reproduce it: Standalone:
Docker:
I'm certain it is a documentation issue, or a misreading of it. There is no
All of the documentation here uses |
🤦♂ Oh, of course. I'll close this with egg on my face. Sorry @ruebot. |
No worries! :-D |
Right now on 0.17.0, using Docker, running any DataFrame command leads to a
java.lang.NullPointerException
error.For example,
leads to
We should try to get it so that on Docker the DataFrame commands work out of the box (which they did before, I think..).
The text was updated successfully, but these errors were encountered: