How to extract total content from a blog and across the whole website using import.io

import.io
data_extraction

#1

Hello,

I am trying to use import.io to extract some data from AV but here is what I am getting:


As can be seen in the red box,only a part of the total content is being captured and also only the first page is being used to capture the data.How do I extract the whole content into a single column and do this for all the pages??
While selecting the content of the columns there is no option to expand on a selection and train it to capture more details.
Can somebody please guide me on this one!!


#2

@shuvayan

I think @chandnijoshi09 and Krishna, our apprentice should be able to help you out - they did a similar exercise last year :smile:

Regards,
Kunal


#3

you can also look at rvest package from R