www.Gov.CN - Article Scrape

Generated by a call to:

Pause                   pause       = Pause.getFSInstance("state.dat");
ScrapedArticleReceiver  receiver    = ScrapedArticleReceiver.saveToFS("articleData/");
StorageWriter           log         = new StorageWriter();

pause.initialize();

// articleURLs parameter was generated in previous step.
ScrapeArticles.download(receiver, articleURLs, ns.articleGetter, true, null, false, pause, log);


NOTE: There are some article's on a few of the sub-sections in this news website where the ArticleGet instance was able to identify were the actual article-body was located on the web-page where the article was contained. In these cases, eiether a 'branched' version of ArticleGet should be provided that can capably deal with different types of web-page new articles... **OR** a separate scrape for news-sub-sections that have different types of articles should be performed. Both will suffice.

For the purposes of this use-case demonstration, it seems reasonable to show what 'error cases' would look like. As such now 'Branched' ArticleGet instance has been provided. This is also to keep simplicity of how to use the classes ScrapeArticles and ScrapeURLs, without moving on to the more advanced topics.


*****************************************************************************************
*****************************************************************************************
 Downloading Articles
*****************************************************************************************
*****************************************************************************************


Visiting URL: [0000 of 0006, 0000 of 0008]  - http://www.gov.cn/premier/2020-09/25/content_5546989.htm
Available Memory: 29,177,336	Total Memory: 32,571,392
	Page contains (471) HTMLNodes.
	Page <TITLE> element is: 李克强:重点建设好群众身边的健身场地_总理_中国政府网
	Article body contains (248) HTMLNodes.
	Article contains (5) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0000 of 0006, 0001 of 0008]  - http://www.gov.cn/premier/2020-09/24/content_5546669.htm
Available Memory: 28,376,744	Total Memory: 32,571,392
	Page contains (473) HTMLNodes.
	Page <TITLE> element is: 李克强开会部署:进一步提高上市公司质量_总理_中国政府网
	Article body contains (250) HTMLNodes.
	Article contains (5) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0000 of 0006, 0002 of 0008]  - http://www.gov.cn/premier/2020-09/23/content_5546501.htm
Available Memory: 28,341,472	Total Memory: 32,571,392
	Page contains (481) HTMLNodes.
	Page <TITLE> element is: 李克强主持召开国务院常务会议 部署进一步提高上市公司质量 保护投资者权益推动资本市场持续平稳健康发展等_总理_中国政府网
	Article body contains (258) HTMLNodes.
	Article contains (5) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0000 of 0006, 0003 of 0008]  - http://www.gov.cn/premier/2020-09/24/content_5546817.htm
Available Memory: 28,421,152	Total Memory: 32,571,392
	Page contains (462) HTMLNodes.
	Page <TITLE> element is: 李克强对第五届新农民新业态创业创新大会作出重要批示_总理_中国政府网
	Article body contains (239) HTMLNodes.
	Article contains (3) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0000 of 0006, 0004 of 0008]  - http://www.gov.cn/guowuyuan/2020-09/24/content_5546932.htm
Available Memory: 28,444,704	Total Memory: 32,571,392
	Page contains (464) HTMLNodes.
	Page <TITLE> element is: 韩正主持召开推动长三角一体化发展领导小组全体会_国务院副总理韩正_中国政府网
	Article body contains (237) HTMLNodes.
	Article contains (4) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0000 of 0006, 0005 of 0008]  - http://www.gov.cn/guowuyuan/2020-09/24/content_5546708.htm
Available Memory: 28,407,640	Total Memory: 32,571,392
	Page contains (409) HTMLNodes.
	Page <TITLE> element is: 胡春华将主持中国东北地区和俄罗斯远东及贝加尔地区政府间合作委员会第三次会议_国务院副总理胡春华_中国政府网
	Article body contains (182) HTMLNodes.
	Article contains (3) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0000 of 0006, 0006 of 0008]  - http://www.gov.cn/guowuyuan/2020-09/25/content_5546944.htm
Available Memory: 28,323,936	Total Memory: 32,571,392
	Page contains (451) HTMLNodes.
	Page <TITLE> element is: 王勇出席全国煤矿智能化建设现场推进会并讲话_国务委员王勇_中国政府网
	Article body contains (224) HTMLNodes.
	Article contains (3) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0000 of 0006, 0007 of 0008]  - http://www.gov.cn/guowuyuan/2020-09/24/content_5546941.htm
Available Memory: 28,474,800	Total Memory: 32,571,392
	Page contains (409) HTMLNodes.
	Page <TITLE> element is: 王毅出席亚信成员国外长视频特别会议_国务委员王毅_中国政府网
	Article body contains (182) HTMLNodes.
	Article contains (3) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0001 of 0006, 0000 of 0011]  - http://www.gov.cn/zhengce/content/2020-09/25/content_5547095.htm
Available Memory: 28,445,744	Total Memory: 32,571,392
	Page contains (897) HTMLNodes.
	Page <TITLE> element is: 国务院办公厅转发国家发展改革委关于促进特色小镇规范健康发展意见的通知(国办发〔2020〕33号)_政府信息公开专栏
	ArticleGet.apply(...) failed: The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))
Exception Cause Chain:
THROWN: Torello.HTML.Tools.NewsSite.ArticleGetException
throwable.getLocalizedMessage():	[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
throwable.getMessage():		[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
0229: Torello.HTML.Tools.NewsSite.ArticleGet.lambda$usual$5c6ebc7$1()
0255: Torello.HTML.Tools.NewsSite.ScrapeArticles.download()
0054: CNB.main()


Visiting URL: [0001 of 0006, 0001 of 0011]  - http://www.gov.cn/zhengce/content/2020-09/23/content_5546373.htm
Available Memory: 28,137,608	Total Memory: 32,571,392
	Page contains (898) HTMLNodes.
	Page <TITLE> element is: 国务院办公厅关于加快医学教育创新发展的指导意见(国办发〔2020〕34号)_政府信息公开专栏
	ArticleGet.apply(...) failed: The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))
Exception Cause Chain:
THROWN: Torello.HTML.Tools.NewsSite.ArticleGetException
throwable.getLocalizedMessage():	[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
throwable.getMessage():		[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
0229: Torello.HTML.Tools.NewsSite.ArticleGet.lambda$usual$5c6ebc7$1()
0255: Torello.HTML.Tools.NewsSite.ScrapeArticles.download()
0054: CNB.main()


Visiting URL: [0001 of 0006, 0002 of 0011]  - http://www.gov.cn/zhengce/content/2020-09/21/content_5545407.htm
Available Memory: 28,124,776	Total Memory: 32,571,392
	Page contains (694) HTMLNodes.
	Page <TITLE> element is: 国务院办公厅关于同意太行山旅游业发展规划(2020—2035年)的函(国办函〔2020〕74号)_政府信息公开专栏
	ArticleGet.apply(...) failed: The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))
Exception Cause Chain:
THROWN: Torello.HTML.Tools.NewsSite.ArticleGetException
throwable.getLocalizedMessage():	[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
throwable.getMessage():		[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
0229: Torello.HTML.Tools.NewsSite.ArticleGet.lambda$usual$5c6ebc7$1()
0255: Torello.HTML.Tools.NewsSite.ScrapeArticles.download()
0054: CNB.main()


Visiting URL: [0001 of 0006, 0003 of 0011]  - http://www.gov.cn/zhengce/content/2020-09/21/content_5545345.htm
Available Memory: 28,191,112	Total Memory: 32,571,392
	Page contains (3906) HTMLNodes.
	Page <TITLE> element is: 国务院关于取消和下放一批行政许可事项的决定(国发〔2020〕13号)_政府信息公开专栏
	ArticleGet.apply(...) failed: The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))
Exception Cause Chain:
THROWN: Torello.HTML.Tools.NewsSite.ArticleGetException
throwable.getLocalizedMessage():	[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
throwable.getMessage():		[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
0229: Torello.HTML.Tools.NewsSite.ArticleGet.lambda$usual$5c6ebc7$1()
0255: Torello.HTML.Tools.NewsSite.ScrapeArticles.download()
0054: CNB.main()


Visiting URL: [0001 of 0006, 0004 of 0011]  - http://www.gov.cn/zhengce/content/2020-09/21/content_5545394.htm
Available Memory: 28,157,592	Total Memory: 32,571,392
	Page contains (1050) HTMLNodes.
	Page <TITLE> element is: 国务院办公厅关于以新业态新模式引领新型消费加快发展的意见(国办发〔2020〕32号)_政府信息公开专栏
	ArticleGet.apply(...) failed: The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))
Exception Cause Chain:
THROWN: Torello.HTML.Tools.NewsSite.ArticleGetException
throwable.getLocalizedMessage():	[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
throwable.getMessage():		[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
0229: Torello.HTML.Tools.NewsSite.ArticleGet.lambda$usual$5c6ebc7$1()
0255: Torello.HTML.Tools.NewsSite.ScrapeArticles.download()
0054: CNB.main()


Visiting URL: [0001 of 0006, 0005 of 0011]  - http://www.gov.cn/zhengce/content/2020-09/21/content_5544926.htm
Available Memory: 28,024,080	Total Memory: 32,571,392
	Page contains (1826) HTMLNodes.
	Page <TITLE> element is: 国务院关于印发北京、湖南、安徽自由贸易试验区总体方案及浙江自由贸易试验区扩展区域方案的通知(国发〔2020〕10号)_政府信息公开专栏
	ArticleGet.apply(...) failed: The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))
Exception Cause Chain:
THROWN: Torello.HTML.Tools.NewsSite.ArticleGetException
throwable.getLocalizedMessage():	[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
throwable.getMessage():		[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
0229: Torello.HTML.Tools.NewsSite.ArticleGet.lambda$usual$5c6ebc7$1()
0255: Torello.HTML.Tools.NewsSite.ScrapeArticles.download()
0054: CNB.main()


Visiting URL: [0001 of 0006, 0006 of 0011]  - http://www.gov.cn/premier/2020-09/17/content_5544243.htm
Available Memory: 27,673,088	Total Memory: 32,571,392
	Page contains (498) HTMLNodes.
	Page <TITLE> element is: 李克强主持召开国务院常务会议 确定政务服务“跨省通办”和提升“获得电力”服务水平的措施等_总理_中国政府网
	Article body contains (275) HTMLNodes.
	Article contains (5) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0001 of 0006, 0007 of 0011]  - http://www.gov.cn/xinwen/2020-08/10/content_5533030.htm
Available Memory: 28,165,072	Total Memory: 32,571,392
	Page contains (467) HTMLNodes.
	Page <TITLE> element is: 秒懂国务院 | “拍了拍”中小企业 国务院这个条例帮你们“催账”_视频新闻_中国政府网
	Article body contains (279) HTMLNodes.
	Article contains (4) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0001 of 0006, 0008 of 0011]  - http://www.gov.cn/xinwen/2020-04/09/content_5500509.htm
Available Memory: 28,149,336	Total Memory: 32,571,392
	Page contains (437) HTMLNodes.
	Page <TITLE> element is: 重点场所、单位、人群怎么防控疫情?这些事一定要注意!_图解图表_中国政府网
	Article body contains (206) HTMLNodes.
	Article contains (12) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0001 of 0006, 0009 of 0011]  - http://www.gov.cn/home/2014-02/18/content_5046260.htm
Available Memory: 28,122,920	Total Memory: 32,571,392
	Page contains (1337) HTMLNodes.
	Page <TITLE> element is: 中国政府网微博、微信_附件_中国政府网
	Article body contains (1139) HTMLNodes.
	Article contains (111) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0001 of 0006, 0010 of 0011]  - http://www.gov.cn/guoqing/2018-06/22/content_5300522.htm
Available Memory: 28,087,648	Total Memory: 32,571,392
	Page contains (326) HTMLNodes.
	Page <TITLE> element is: 国务院研究室_中国概况_中国政府网
	Article body contains (156) HTMLNodes.
	Article content contained 0 HTML IMG elements

Visiting URL: [0002 of 0006, 0000 of 0022]  - http://www.gov.cn/hudong/2020-09/24/content_5546336.htm
Available Memory: 28,187,480	Total Memory: 32,571,392
	Page contains (570) HTMLNodes.
	Page <TITLE> element is: 卫生健康委答网民关于“建议修订粉尘类职业病防治有关规范”的留言_国家卫生健康委员会_中国政府网
	Article body contains (151) HTMLNodes.
	Article contains (2) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0001 of 0022]  - http://www.gov.cn/hudong/2020-09/21/content_5544476.htm
Available Memory: 28,153,536	Total Memory: 32,571,392
	Page contains (567) HTMLNodes.
	Page <TITLE> element is: 市场监管总局答网民关于“希望简化食品经营许可证办理手续”的留言_国家市场监督管理总局_留言回复_我向总理说句话_中国政府网
	Article body contains (148) HTMLNodes.
	Article contains (2) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0002 of 0022]  - http://www.gov.cn/hudong/2020-09/17/content_5544210.htm
Available Memory: 28,118,872	Total Memory: 32,571,392
	Page contains (554) HTMLNodes.
	Page <TITLE> element is: 市场监管总局答网民关于“建议允许小微企业‘居家办公’”的留言_国家市场监督管理总局_中国政府网
	Article body contains (135) HTMLNodes.
	Article contains (2) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0003 of 0022]  - http://www.gov.cn/xinwen/2020-08/12/content_5534321.htm
Available Memory: 28,084,992	Total Memory: 32,571,392
	Page contains (437) HTMLNodes.
	Page <TITLE> element is: 有大事发生,跟你家老房子有关_图解图表_中国政府网
	Article body contains (206) HTMLNodes.
	Article contains (4) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0004 of 0022]  - http://www.gov.cn/xinwen/2020-04/24/content_5505734.htm
Available Memory: 28,180,616	Total Memory: 32,571,392
	Page contains (480) HTMLNodes.
	Page <TITLE> element is: 你有一份来自国务院部门的问卷_滚动新闻_中国政府网
	Article body contains (253) HTMLNodes.
	Article contains (5) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0005 of 0022]  - http://www.gov.cn/xinwen/2020-09/21/content_5545219.htm
Available Memory: 28,153,304	Total Memory: 32,571,392
	Page contains (793) HTMLNodes.
	Page <TITLE> element is: 国务院新闻办就邮政快递业助力脱贫攻坚有关情况举行新闻发布会_新闻发布_中国政府网
	Article body contains (566) HTMLNodes.
	Article contains (6) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0006 of 0022]  - http://www.gov.cn/xinwen/2020-09/16/content_5543820.htm
Available Memory: 28,115,776	Total Memory: 32,571,392
	Page contains (873) HTMLNodes.
	Page <TITLE> element is: 国务院新闻办就中国科学院“率先行动”计划第一阶段实施进展有关情况举行新闻发布会_新闻发布_中国政府网
	Article body contains (646) HTMLNodes.
	Article contains (8) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0007 of 0022]  - http://www.gov.cn/xinwen/2020-08/17/content_5536518.htm
Available Memory: 28,082,224	Total Memory: 32,571,392
	Page contains (497) HTMLNodes.
	Page <TITLE> element is: 张纪南:以决战决胜的姿态抓好人社扶贫工作_滚动新闻_中国政府网
	Article body contains (270) HTMLNodes.
	Article contains (3) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0008 of 0022]  - http://www.gov.cn/xinwen/2020-08/21/content_5536425.htm
Available Memory: 28,167,792	Total Memory: 32,571,392
	Page contains (587) HTMLNodes.
	Page <TITLE> element is: 鄂竟平:坚持人民生命高于一切坚决打赢防汛抗洪硬仗_滚动新闻_中国政府网
	Article body contains (360) HTMLNodes.
	Article contains (3) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0009 of 0022]  - http://www.gov.cn/xinwen/2020-08/21/content_5536445.htm
Available Memory: 28,134,160	Total Memory: 32,571,392
	Page contains (580) HTMLNodes.
	Page <TITLE> element is: 王军:在服务“六稳”“六保”大局中贡献税务力量_滚动新闻_中国政府网
	Article body contains (353) HTMLNodes.
	Article contains (3) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0010 of 0022]  - http://www.gov.cn/hudong/2020-09/11/content_5542556.htm
Available Memory: 28,099,712	Total Memory: 32,571,392
	Page contains (555) HTMLNodes.
	Page <TITLE> element is: 市场监管总局答网民关于“企业办注销,各种清缴手续能否简化”的留言_国家市场监督管理总局_中国政府网
	Article body contains (136) HTMLNodes.
	Article contains (2) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0011 of 0022]  - http://www.gov.cn/hudong/2020-08/24/content_5537017.htm
Available Memory: 28,069,392	Total Memory: 32,571,392
	Page contains (749) HTMLNodes.
	Page <TITLE> element is: 市场监管总局答网民关于“降低产品CCC认证费用”的留言_国家市场监督管理总局_中国政府网
	Article body contains (330) HTMLNodes.
	Article contains (3) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0012 of 0022]  - http://www.gov.cn/xinwen/2020-07/03/content_5523840.htm
Available Memory: 28,152,232	Total Memory: 32,571,392
	Page contains (499) HTMLNodes.
	Page <TITLE> element is: 全国电子无偿献血证上线!与纸质版同等效力_滚动新闻_中国政府网
	Article body contains (272) HTMLNodes.
	Article contains (6) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0013 of 0022]  - http://www.gov.cn/xinwen/2020-05/22/content_5513941.htm
Available Memory: 28,123,760	Total Memory: 32,571,392
	Page contains (404) HTMLNodes.
	Page <TITLE> element is: 《政府工作报告》起草组:收到中国政府网有代表性网民建言近1400条_滚动新闻_中国政府网
	Article body contains (216) HTMLNodes.
	Article contains (5) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0014 of 0022]  - http://www.gov.cn/hudong/ducha/2020-09/24/content_5546828.htm
Available Memory: 28,094,696	Total Memory: 32,571,392
	Page contains (424) HTMLNodes.
	Page <TITLE> element is: 贵州群众反映贵州民族大学成人教育合作办学收取高额技能培训费 贵州省认真组织核查督促有关方面清退违规收费_督查回声_中国政府网
	Article body contains (193) HTMLNodes.
	Article contains (4) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0015 of 0022]  - http://www.gov.cn/hudong/ducha/2020-09/24/content_5546635.htm
Available Memory: 28,067,368	Total Memory: 32,571,392
	Page contains (424) HTMLNodes.
	Page <TITLE> element is: 丰宁群众反映向“云窗办照APP”提交申请后近两个月未审核 河北省政府督促有关单位及时整改_督查回声_中国政府网
	Article body contains (193) HTMLNodes.
	Article contains (4) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0016 of 0022]  - http://www.gov.cn/hudong/ducha/2020-09/24/content_5546632.htm
Available Memory: 28,172,136	Total Memory: 32,571,392
	Page contains (424) HTMLNodes.
	Page <TITLE> element is: 德保群众反映摆地摊被当地城管收取2000元保证金 广西壮族自治区政府迅速组织核查 有关单位立即停止收取并退还已收取保证金_督查回声_中国政府网
	Article body contains (193) HTMLNodes.
	Article contains (4) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0017 of 0022]  - http://www.gov.cn/hudong/ducha/2020-09/17/content_5544175.htm
Available Memory: 28,144,904	Total Memory: 32,571,392
	Page contains (426) HTMLNodes.
	Page <TITLE> element is: 三河群众反映货车通行证办理难以及重复检测负担重 河北省政府迅速组织核查 廊坊市政府督促有关单位立行立改_督查回声_中国政府网
	Article body contains (195) HTMLNodes.
	Article contains (4) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0018 of 0022]  - http://www.gov.cn/hudong/ducha/2020-09/17/content_5544176.htm
Available Memory: 28,114,856	Total Memory: 32,571,392
	Page contains (426) HTMLNodes.
	Page <TITLE> element is: 渭南群众反映当地医保局拖欠民营医院医保报销款 陕西省政府督促有关地方和单位及时复核兑付_督查回声_中国政府网
	Article body contains (195) HTMLNodes.
	Article contains (4) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0019 of 0022]  - http://www.gov.cn/xinwen/2017-02/28/content_5171850.htm
Available Memory: 28,087,544	Total Memory: 32,571,392
	Page contains (378) HTMLNodes.
	Page <TITLE> element is: 我向总理说句话,总理会听到吗?(短视频)_视频新闻_中国政府网
	Article body contains (204) HTMLNodes.
	Article contains (1) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0020 of 0022]  - http://www.gov.cn/xinwen/2018-04/15/content_5282577.htm
Available Memory: 28,128,776	Total Memory: 32,571,392
	Page contains (377) HTMLNodes.
	Page <TITLE> element is: 
百万G流量等你来抢!2018政府工作报告答题专场来了!

_
滚动新闻
_中国政府网
	Article body contains (199) HTMLNodes.
	Article contains (2) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0002 of 0006, 0021 of 0022]  - http://www.gov.cn/xinwen/2017-04/01/content_5182725.htm
Available Memory: 28,109,384	Total Memory: 32,571,392
	Page contains (397) HTMLNodes.
	Page <TITLE> element is: 答题送流量!测一测你所知道的政府工作报告,300M流量等你来拿_滚动新闻_中国政府网
	Article body contains (219) HTMLNodes.
	Article contains (2) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0003 of 0006, 0000 of 0009]  - http://www.gov.cn/fuwu/2020-07/28/content_5530716.htm
Available Memory: 28,086,776	Total Memory: 32,571,392
	Page contains (540) HTMLNodes.
	Page <TITLE> element is: 防汛服务专区上线!第一时间掌握预警信息、实时水情_服务信息_中国政府网
	Article body contains (313) HTMLNodes.
	Article contains (9) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0003 of 0006, 0001 of 0009]  - http://www.gov.cn/fuwu/2016-11/24/content_5136864.htm
Available Memory: 28,056,112	Total Memory: 32,571,392
	Page contains (368) HTMLNodes.
	Page <TITLE> element is: 首次申领居民身份证_办事指南_中国政府网
	Article body contains (192) HTMLNodes.
	Article contains (9) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0003 of 0006, 0002 of 0009]  - http://www.gov.cn/fuwu/2016-11/24/content_5136860.htm
Available Memory: 28,129,280	Total Memory: 32,571,392
	Page contains (352) HTMLNodes.
	Page <TITLE> element is: 更换居民身份证_办事指南_中国政府网
	Article body contains (176) HTMLNodes.
	Article contains (9) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0003 of 0006, 0003 of 0009]  - http://www.gov.cn/fuwu/2016-11/24/content_5136852.htm
Available Memory: 28,109,752	Total Memory: 32,571,392
	Page contains (348) HTMLNodes.
	Page <TITLE> element is: 办理临时身份证_办事指南_中国政府网
	Article body contains (170) HTMLNodes.
	Article contains (9) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0003 of 0006, 0004 of 0009]  - http://www.gov.cn/fuwu/2016-11/24/content_5136857.htm
Available Memory: 28,090,240	Total Memory: 32,571,392
	Page contains (324) HTMLNodes.
	Page <TITLE> element is: 补办居民身份证_办事指南_中国政府网
	Article body contains (148) HTMLNodes.
	Article contains (7) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0003 of 0006, 0005 of 0009]  - http://www.gov.cn/zhengce/content/2008-03/28/content_6190.htm
Available Memory: 28,077,432	Total Memory: 32,571,392
	Page contains (907) HTMLNodes.
	Page <TITLE> element is: 现役军人和人民武装警察居民身份证申领发放办法_政府信息公开专栏
	ArticleGet.apply(...) failed: The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))
Exception Cause Chain:
THROWN: Torello.HTML.Tools.NewsSite.ArticleGetException
throwable.getLocalizedMessage():	[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
throwable.getMessage():		[The article or page whose content-body you have tried to retrieve using your ArticleGet produced null.
Factory-generated ArticleGet had called: InnerTagGetInclusive.first(page, "div", "class", TextComparitor.CONTAINS_CSS_CLASS_OR, "article"))]
0229: Torello.HTML.Tools.NewsSite.ArticleGet.lambda$usual$5c6ebc7$1()
0255: Torello.HTML.Tools.NewsSite.ScrapeArticles.download()
0054: CNB.main()


Visiting URL: [0003 of 0006, 0006 of 0009]  - http://www.gov.cn/fuwu/2017-01/06/content_5157060.htm
Available Memory: 28,116,800	Total Memory: 32,571,392
	Page contains (455) HTMLNodes.
	Page <TITLE> element is: 《前往港澳通行证》办理指南_办事指南_中国政府网
	Article body contains (269) HTMLNodes.
	Article contains (6) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0003 of 0006, 0007 of 0009]  - http://www.gov.cn/home/2014-02/18/content_5046260.htm
Available Memory: 28,089,632	Total Memory: 32,571,392
	Page contains (1337) HTMLNodes.
	Page <TITLE> element is: 中国政府网微博、微信_附件_中国政府网
	Article body contains (1139) HTMLNodes.
	Article contains (111) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0003 of 0006, 0008 of 0009]  - http://www.gov.cn/guoqing/2018-06/22/content_5300522.htm
Available Memory: 28,059,304	Total Memory: 32,571,392
	Page contains (326) HTMLNodes.
	Page <TITLE> element is: 国务院研究室_中国概况_中国政府网
	Article body contains (156) HTMLNodes.
	Article content contained 0 HTML IMG elements

Visiting URL: [0005 of 0006, 0000 of 0009]  - http://www.gov.cn/guoqing/2012-04/20/content_2582698.htm
Available Memory: 28,040,576	Total Memory: 32,571,392
	Page contains (2446) HTMLNodes.
	Page <TITLE> element is: 2010年第六次全国人口普查主要数据公报_中国概况_中国政府网
	Article body contains (2257) HTMLNodes.
	Article content contained 0 HTML IMG elements

Visiting URL: [0005 of 0006, 0001 of 0009]  - http://www.gov.cn/xinwen/2018-04/03/content_5279419.htm
Available Memory: 28,124,680	Total Memory: 32,571,392
	Page contains (597) HTMLNodes.
	Page <TITLE> element is: 中国保障宗教信仰自由的政策和实践_滚动新闻_中国政府网
	Article body contains (419) HTMLNodes.
	Article contains (1) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0005 of 0006, 0002 of 0009]  - http://www.gov.cn/xinwen/2020-02/28/content_5484361.htm
Available Memory: 28,090,544	Total Memory: 32,571,392
	Page contains (1684) HTMLNodes.
	Page <TITLE> element is: 中华人民共和国2019年国民经济和社会发展统计公报_部门政务_中国政府网
	Article body contains (1492) HTMLNodes.
	Article contains (39) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0005 of 0006, 0003 of 0009]  - http://www.gov.cn/xinwen/2019-02/28/content_5369270.htm
Available Memory: 28,056,592	Total Memory: 32,571,392
	Page contains (1594) HTMLNodes.
	Page <TITLE> element is: 
2018年国民经济和社会发展统计公报

_
部门政务
_中国政府网
	Article body contains (1412) HTMLNodes.
	Article contains (38) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0005 of 0006, 0004 of 0009]  - http://www.gov.cn/xinwen/2018-02/28/content_5269506.htm
Available Memory: 28,019,072	Total Memory: 32,571,392
	Page contains (1588) HTMLNodes.
	Page <TITLE> element is: 
中华人民共和国2017年国民经济和社会发展统计公报

_
部门政务
_中国政府网
	Article body contains (1406) HTMLNodes.
	Article contains (32) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0005 of 0006, 0005 of 0009]  - http://www.gov.cn/xinwen/2017-02/28/content_5171643.htm
Available Memory: 28,103,976	Total Memory: 32,571,392
	Page contains (1472) HTMLNodes.
	Page <TITLE> element is: 统计局发布2016年国民经济和社会发展统计公报_部门政务_中国政府网
	Article body contains (1298) HTMLNodes.
	Article contains (41) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0005 of 0006, 0006 of 0009]  - http://www.gov.cn/xinwen/2016-02/29/content_5047274.htm
Available Memory: 28,069,432	Total Memory: 32,571,392
	Page contains (1394) HTMLNodes.
	Page <TITLE> element is: 2015年国民经济和社会发展统计公报_部门新闻_中国政府网
	Article body contains (1207) HTMLNodes.
	Article contains (32) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0005 of 0006, 0007 of 0009]  - http://www.gov.cn/xinwen/2015-02/26/content_2822060.htm
Available Memory: 28,035,136	Total Memory: 32,571,392
	Page contains (335) HTMLNodes.
	Page <TITLE> element is: 统计局发布2014年国民经济和社会发展统计公报_部门新闻_中国政府网
	Article body contains (147) HTMLNodes.
	Article contains (2) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.

Visiting URL: [0005 of 0006, 0008 of 0009]  - http://www.gov.cn/xinwen/2019-09/27/content_5434223.htm
Available Memory: 28,016,552	Total Memory: 32,571,392
	Page contains (3831) HTMLNodes.
	Page <TITLE> element is: 中华人民共和国大事记(1949年10月-2019年9月)_滚动新闻_中国政府网
	Article body contains (3643) HTMLNodes.
	Article contains (2) image TagNodes.
ARTICLE LOADED.  Sending to ScrapedArticleReceiver.
*****************************************************************************************
Traversing Site Completed.
Loaded a total of (49) articles.