# Black Holes and Revelations ### Using software group #### How do we create a raster of uncertainty from the probabilities of interactions and occurrence? * The probability of interaction around 0.5 if the most uncerain - we should somehow point users to go sample in the locations with this probability. * We are trying to find the locations with the highest probability of co-occurrence and the with the probability of interaction closer to 0.5. * Similar environments should result in the same probability of interactions. * We want to exclude locations with super high or super low probability of interactions before we proceed. > *:warning: Our uncertainty about an interaction in a given location is influenced by the probability of interaction in the neighbour cells.* * How do we measure the distance to the most uncertain interaction and most likely co-occurrence * Euclidean distance? But this gives weird situations depending on which side the distance comes from * Weights would be a better alternative than the Euclidean distance * If we want to use the full probability distribution for the interaction in the metaweb, we will need to use a double integral * How do we optimize this distance * Minimize something like $W_1 * (P(A \rightarrow B) - 0.5)^2 + W_2 * (P(A)P(B) - 0.5)^2$ * we've got a _curiosity score_ which shows us where we should go sampling when its value is high. ## Code notes * Network table has less lines than expected because forbidden links were excluded * coocurrence vector is: * each column is a site * each row is a pair of species #### TODO * Add 0s to diagonal of cooccurrence layer ## GH notes * cooccurrence branch - devs finalized cooccurrence matrix, excluding forbidden links ## Glossary Please add terms / edit definitions as you see fit ---WIP **INTERACTION**: species A eats species B (consumer-resource interaction) **NETWORK**: Structured set of interactions; set of vertices $V$, function $f:(V * Y) -> {0,1}$ / web showing all possible species interactions present. **GLOBAL METAWEB**: Network of all potential interactions within total species pool. **LOCAL METAWEB**: subset of potential species interactions from the global metaweb for species present at a specific site. **PROBABILITY**: Degree of belief; measurement of the frequency of observations in a 🌟 universe of possibilities 🌟. **SAMPLING**: point/area in which to collect data on species occurrence and interactions. **UNCERTAINTY**: Degree of belief that two species interact and co-occur at a spatial location **ENTROPY**: Amount of information provided by _new_ data. **SPECIES DISTRIBUTION MODEL (SDM)**: uses environmental data to statistically infer the spatial distribution of a species population **PROBABILITY OF CO-OCCURANCE** $P(A)P(B)$: likelihood that species A and species B are present at a sampling location **PROBABILITY OF INTERACTION** $P(A→B)$ : likelihood that species A eats species B **CURIOSITY**: value given to a sampling location, based on the degree of proposed sampling effort and the rarity of interactions being searched for **API**: Any function you call but didn't write (Application Programatic Interface). **Potential Interaction**: Any interaction with a probability of interaction >0. **Realised Interaction**: Obeserved empirical interaction. **Bias**: Systematically misestimating.