problems related to statistical evidence in datascience

working on some csv file data which are related to some medical insurance company. I have attached here picture of my data structure.

I try to find out:- 1.charges of people who smoke differ significantly from the people who don’t? 2… Is the distribution of bmi across women with no children, one child and two children, the same ?

To slove those two problems should I use Hypothesis testing or it can be done by graphical representation?

Actually examiner ask these two questions with statistical evidence.I am confuse about the word “statitical evidence”.

Any help regarding this?

statistical evidence can be some kind of probability or level of confidence you are suggesting that a particular bmi value corresponds to a women having single child / 2 children … Its an implementation of hypothesis testing .

© Copyright 2013-2019 Analytics Vidhya