-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Post-treatments and warning messages #4
Comments
I do not have an intuition for this warning but your questions are very relevant. Could you please provide:
Also, the robustness of your network is linked to the stability of the edges. Could you please provide the stability curve for the stability selection step ? (plot given by the |
**"site" **is the environment covariance matrix, with 8 obs. of 8 variables, variables including disturbs such as human disturb, temperature etc.
PLNMODEL<- PLN(data1~site$HU)
|
It is possible the warning is a result of your very few number of observations (rows).
|
Thank you Momal, I will try, thank you very much! |
Hi sorry for my late answer. But the problem is that we don't known how to set Pt a priori because they are probabilities scaled for a tree structure (the sum of all probabilities is the number of edges in a tree graph), they are not really edges probabilities of presence. This is where StATS() helps: it provides the optimal Pt, which is the value for which we have a good stability value across resamples, which means a good agreement between the resamples on the selected edges. Additionally to providing a solution to the choice of Pt, you see here that we change of quantity of interest, going from conditional probabilities to well-understandable selection frequencies. This is important for interpretation : nos the weight on edges is actually a measure of the robustness of edges. So after ResampleEMtree(), use StATS() to find the optimal Pt, and then you can either threshold the edge selection frequencies if you wish a binary network, or juste continue with the edges frequencies to work with a weighted network. Is that clearer? |
Yes!!! Never understand a model so clear before!! Thank you so so so so so much Momal!! |
Hi, Momal:
data
Can I just ignore these warnings? I try each year-season data, on the one hand, I exclude species data with too many "0"; on the other hand, some year-season has many plots like 37. But all have the warnings. Looking forward to your reply! |
Hi, sorry again for the late answer. Best, |
Thank you so much for the reply! It help me to continue my work! The data dimension 23 x 10 is the species I selected more than 3 times in the matrix, and the network I showed is all the species including some species which may only occur once (I try whether there is difference with or without these species, and the PLN result is no). I think the network is more complete to show with 25 species to you, so I use this network. Sorry the inconsistency before and after confuses you. |
Hi Momal:
I set different number of maxlter and get different network interactions. So I am curious how to set a maxIter. I have 8-year data, I want to show network each year. And the species I have most is 21 in 2020. So I think maxIter is 21 in all networks, so that all (8) networks have the uniform standards. Do I understand correctly? Looking forward to your reply! Best wishes! |
Hi Ying,
- maxIter is the "Maximum number of EMtree iterations at each
sub-sampling". It controls the number of iterations of the algorithm, so
setting it to 21 can be too low and stopping the algorithm before it gets a
chance to reach convergence. Il would be better to set this parameter to at
least 50.
- Your S parameter is very high, not that it is a problem but I think it is
unnecessary. S is the "Total number of sub-samples.", and as you have 21
samples maximum setting S to 500 is enough.
- cond.tol should be very low as it is a precision threshold for the
computations. Default value might be too low as you have little number of
samples, but I do not recommend to set it above 1e-6. Setting this higher
creates a risk that the algorithm does not converge properly.
You see that it is mostly about the algorithm's convergence, that you can
check by running the simple EMtree() function with plot=TRUE (as a first
check for maxiter, before ResampleEMtree() for example.
Best,
Raphaëlle
Le mar. 31 mai 2022 à 04:34, GENG Ying ***@***.***> a écrit :
… Hi Momal:
Sorry I still have questions about EMtree.
ResampEmtreeFit<-ResampleEMtree(counts=data, S=999, covar_matrix = site$HL, maxIter=25,cond.tol=1e-8, cores=1)
I set different number of *maxlter* and get different network
interactions. So I am curious how to set a *maxIter*. I have 8-year data,
I want to show network each year. And the species I have most is 21 in
2020. So I think *maxIter* is *21* in all networks, so that all (8)
networks have the uniform standards. Do I understand correctly?
And I have no idea how to set *cond.tol*, is this influence my result?
Looking forward to your reply!
Best wishes!
Geng Ying.
—
Reply to this email directly, view it on GitHub
<#4 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AH2P2ZIUMZZRL756X4YH7G3VMV3EJANCNFSM5N4GSVIQ>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Hi Momal: Best wishes! |
"data" is the species abundance matrix
##"site" is the environmant covariance matrix
##HU is a factor in site
fit PLN model
Follow is my questions:
The text was updated successfully, but these errors were encountered: