-
Notifications
You must be signed in to change notification settings - Fork 326
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gluon-core: babel: broken DNS #1500
Comments
Due to the revert batman-based nodes are unaffected, babel-nodes with ipv6 on their wan-ports may be affected: Besides this hack in commit b3d7011 we could consider adding src stanzas to all the babel-routes. I am working on a patch for this. |
A babel patch exists now. This issue will be fixed by using a babel version based on https://github.com/christf/babeld/tree/PREFSRC and an appropriate babeld configuration.
|
@christf maybe you can post a short update on your progress and plans, i think we should talk about removing the feature again if there's no chance to get it fixed completely soon? |
Following a conversation on the babeld mailing list a patch was prepared by myself that is ready and working. A PR is raised upstream in babeld - see here: jech/babeld#18. Unfortunately it is not merged yet. The only symptom by this feature not being merged is that on multi-homed nodes (having ipv6 on WAN) the kernel decides which outbound IP address is used when doiing requests sometimes causing DNS leaks. imho not a big deal - still it is nice to get this fixed which will happen after prefsrc is merged. in babeld there are tests going on for preparing a merge of the unicast branch on which this is all based on. There is no need to revert anything in gluon. |
ok, thanks for the update! i removed the "regression" label, as nothing else beside the new feature is broken anymore. i added a release blocker label instead. |
as far as i see it, the fix has been pushed to master branch of babeld |
The fix was pushed to babeld and the required configuration has been tested for two months now. Waiting for modules update to openwrt master where babeld 1.9.1, containing the patch, has just been included. |
@christf are we now on a openwrt master status which includes this? what does this mean, there is no issue anymore? and what about the initial change which wanted to make DNS requests to leave via specific interfaces? |
@christf @NeoRaider ping? |
The original change was broken (the 'loopback' interface doesn't even have the primary node address in non-Babel setups), but it is also too specific - all protocols need to use the correct interfaces, DNS is not special in any way. |
@NeoRaider so "someone" needs to work this out "sometime" ? 🗡️ @christf are we now on a openwrt master status which includes this? what does this mean, there is no issue anymore? |
tfor babeld we now have the ability to specify the correct src-address. This was merged in november with #1877. This leads to ipv6 traffic leaving the correct interface without other hacks. |
if you think this affects batman too, do we need a new issue? what exactly is the issue? do we need something like b3d7011 for batman? @NeoRaider |
as @christf and i just weren't able to reproduce a problem with the src address running Gluon v2019.1.1 with batman-adv, maybe there is no issue left. waiting for @NeoRaider opinion though... The x86 node we tested with had a public IPv6 address on both br-client and br-wan and two default routes similar to this:
The node also had IPv6 nameservers for both br-wan and br-client. |
@rotanid Are these default routes in different routing tables? The WAN default route should never appear in the main table. I'm aware of only one serious issue regarding source address / interface selection in current Gluon (with batman-adv, but likely also affects babel): #1132. Is there still anything to fix for babel? |
@NeoRaider i did not query any routing table on purpose, afair i only did regarding babel, as far as i understand the comment from @christf it's only about batman now |
@rotanid Hmm, I think that means that the br-wan default route was in the default table as well, weird - but I don't fully trust the busybox ip, so I might be wrong. If things are working as intended, only the mesh routes should end up in the main table ( |
@NeoRaider with busybox ip there are 27 routes. all in the table 254 output, none in table 1 output. any other test cases to be able to say this issue does not exist with batman-adv setups? |
Okay, I think this can be closed (and indeed Busybox ip is useless when it comes to multiple routing tables...). The remaining weirdness in our routing setup is tracked in #1132. |
On-node DNS resolving is broken as of b3d7011 - quick fix applied by reverting said commit in 78ed75e
Solution by @christf is pending.
The text was updated successfully, but these errors were encountered: