Multi-Dimensional Description and Spatio-temporal Visualization of News Events Based on RSS

Expand
  • 1. Geomatics College, Shandong University of Science and Technology, Qingdao 266590, China;
    2. Satellite Surveying and Mapping Application Center, State Bureau of Surveying and Mapping, Beijing 101300, China;
    3. Wuhan University, Wuhan 430072, China

Received date: 2013-08-13

  Revised date: 2013-11-18

  Online published: 2014-05-10

Abstract

Traditional methods of news retrieval which return a series of related news-list that sorted by time or events such as Baidu, are lack of intuitive description in both temporal and spatial dimensions, as well as spatio-temporal development that related to news events. This paper presented a method of multi-dimensional description and spatio-temporal visualization of online RSS news events, which helps readers understand the spatio-temporal development of the whole news event. Firstly, this method pulled news from several well-known websites such as Baidu, Sina and Google News based on RSS (Really Simple Syndication) service, and then used a multi-dimensional description method to mark the spatial and temporal dimensions of RSS news. The method of temporal dimensional description defines news publishing time as news' occurrence time, while the method of spatial dimensional description dynamically parses and identifies Chinese geographical name from news description, and then matches them with their geographical coordinates. Spatial dimensional description method is the primary content of this article. This approach has been separated into four stages to accomplish the analyzing process: (i) XSL Transformation, which uses XSL(eXtensible Stylesheet Language) to transform a news RSS document into a HTML(Hypertext Markup Language) document;(ii) Description Extraction, which uses the regular expression to extract the news description from news HTML document;(iii) Chinese place Name Extraction, which uses ICTCLAS to extract geographic name from description;And (iv) Geocoding, which uses Google Geocoder API to get the geographical coordinates of the place name. At last, this paper demonstrated the spatio-temporal visualization of news events and made a brief analysis by setting H7N9 hot news as an example. In the analysis, temporal visualization used transition color to show the changes between two time nodes according to the amount of news, and then used line chart to show the variation tendency of the total amount of news. Furthermore, spatial visualization clustered news by province and used different-sized plots to indicate the diffidence of news amounts between two provinces.

Cite this article

SANG Peng, TANG Xinming, AI Bo, WANG Huabin . Multi-Dimensional Description and Spatio-temporal Visualization of News Events Based on RSS[J]. Journal of Geo-information Science, 2014 , 16(3) : 341 -348 . DOI: 10.3724/SP.J.1047.2014.00341

References

[1] Weskamp M. Newsmap[DB/OL]. 2013-03-04. http://www.marumushi.com/apps/newsmap.
[2] Mod C. Buzztracker-World News[DB/OL]. 2013-03-04. http://www.buzztracker.org.
[3] 刘晓娟,陈嘉勇,刘世希.文本可视化在新闻事件演变中的应用[J].图书情报工作,2010,54(18):67-71.
[4] Bradshaw P.Yahoo Tracker by FlatFeetPete[DB/OL].2013-03-04.http://www.flatfeetpete.com/ytrack/index.html.
[5] Zuylen C V. From documents to information: A new model for information retrieval[EB/OL]. 2013-03-04. http:/www.inxight.com/pdfs/TimeWall_FinalPrint.pdf.
[6] Havre S, Nowell L. ThemeRiver: Visualizing theme changes over time[J]. Proceedings of the IEEE Symposium on Information Visualization, 2000(10):115-123.
[7] Havre S, Hetzler E, Whitney P, et al. ThemeRiver: Visualization thematic changes in large document collections[J]. Proceedings of the IEEE Transactions on Visualization and Computer Graphics, 2002,18(1):9-20.
[8] 安海岗.专题新闻文本集信息可视化理论模型及实证研究[J].情报杂志,2012,31(8):37-43.
[9] Card S K, Mackinlay J D, Shneiderman B. Reading in information visualization: Using vision to think[M]. San Francisco, CA: Morgan Kaufmann, 1999.
[10] Yang C C, Shi X, Wei C P. Trancing the event evolution of terror attacks from online news[C]. Proceeding of the IEEE International Conference on Intelligence and Security Informatics Berlin, 2006,343-354.
[11] 梁家豪.结合事件主轴摘要之议题回顾机制与新闻报道应用[D].高雄:台湾国立中山大学,2005.
[12] 卢耀素.GIS 可视化互联网新闻搜索引擎的研究与实现[D].武汉:华中科技大学,2006.
[13] 苏哲.RSS 新闻聚合型网站的数据分析系统[D].北京:北京交通大学,2010.
[14] 王婷.基于RSS技术的在线RSS新闻阅读器的设计与实现[J].硅谷,2011(18):51-52.
[15] Jaiswal A, Pezanowski P, Mitra P, et al. GeoCAM: A geovisual analytics workspace to contextualize and interpret statements about movement[J]. Journal of Spatial Information Science, 2011(3):65-101.
[16] Tomaszewski B. Developing geo-temporal context from implicit source with geovisual analytics[J]. Computers & Geosciences, 2011(1):86-92.
[17] 邓莉琼,吴玲达,陈丹雯,等.基于OpenGL 的时空信息可视化系统设计与实现[J]. 系统仿真学报,2009,21(1): 163-165.
[18] 张立鑫.基于Silverlight 开发的RSS 聚合系统的设计与实现[J].电脑知识与技术,2011,7(5):1065-1068.
[19] 谢倩堃.RSS 新闻的更新特征分析及RSSReader 的订阅模型[D].北京:北京交通大学,2008.
[20] 王鹏,张永奎,张彦,等.基于新闻网页主体要素的网页去重方法研究[J].计算机工程与应用,2007,43(8):177-180.
[21] Mubareka S, Khudhairy D A, Bonn F, et al. Standardising and mapping open-source information for crisis regions: the case of post-conflict Iraq[J]. Disasters,2005,29(3): 237-254.
[22] 周顺平,王海龙,于海燕.使用XSL表现XML的几种方法[J].计算机与现代化,2002(5):7-10.
[23] 李文华,杨亚仿,吴昊.基于正则表达式的HTML信息提取[J].电脑开发与应用2012,25(4):44-46.
[24] 刘群,张华平,俞鸿魁,等.基于层次隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429.
[25] 徐春玉.好奇心理学[M].杭州:浙江教育出版社,2008.
[26] 艾波,唐新明,艾廷华,等.利用透明度进行时空信息可视化[J].武汉大学学报:信息科学版,2012,37(2):229-232.

Outlines

/