Precision and Recall
21 Dec 2017
Let’s assume that you have a cancen recognition process with 1 percent error process. But only 0.5 percent of patiens have cancer.
If you precict always 0, then you have a 0.5 percent error, that seems better than the process.
How to calculate true quality indicators?
Of all patients where we predicted $y = 1$, what fraction actually has cancer?
Of all patients that actually have cancer, what fraction did we correcty detect as having cancer?
Higher precision $\rightarrow$ lowe recall
Higher recall $\rightarrow$ lower precision
Average score is called $F_1 \text{Score}$: