SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.
RSS Feed

HELP: Tutorials | FAQ
CONTACT US: Contact info

Search Results

Journal Article

Citation

D'Andrea E, Ducange P, Lazzerini B, Marcelloni F. IEEE Trans. Intel. Transp. Syst. 2015; 16(4): 2269-2283.

Copyright

(Copyright © 2015, IEEE (Institute of Electrical and Electronics Engineers))

DOI

10.1109/TITS.2015.2404431

PMID

unavailable

Abstract

Social networks have been recently employed as a source of information for event detection, with particular reference to road traffic congestion and car accidents. In this paper, we present a real-time monitoring system for traffic event detection from Twitter stream analysis. The system fetches tweets from Twitter according to several search criteria; processes tweets, by applying text mining techniques; and finally performs the classification of tweets. The aim is to assign the appropriate class label to each tweet, as related to a traffic event or not. The traffic detection system was employed for real-time monitoring of several areas of the Italian road network, allowing for detection of traffic events almost in real time, often before online traffic news web sites. We employed the support vector machine as a classification model, and we achieved an accuracy value of 95.75% by solving a binary classification problem (traffic versus non-traffic tweets). We were also able to discriminate if traffic is caused by an external event or not, by solving a multi-class classification problem and obtaining an accuracy value of 88.89%.

SafetyLit Note:
There are several journal articles that were published after this was released (August 2015) that are near word-for-word duplicates of this one. It is worth noting that this seemingly original article was not cited in the near-duplicates. That is not to say that all current Twitter stream analysis have been inspired by this article without attribution.

SafetyLit note: We believe that the inclusion of references in this case falls under fair use. Why? 1) This article was published as open access and the journal and copyright owner is acknowledged; 2) a link to full text is provided; 3) reproduction of this reference list is relevant to a commentary on what is and is not original thought and 4) the items in this reference list are part of the data upon which an investigation are based.

References

1. F. Atefeh and W. Khreich, "A survey of techniques for event detection in Twitter", Comput. Intell., vol. 31, no. 1, pp. 132-164, 2015.

2. P. Ruchi and K. Kamalakar, "ET: Events from tweets", Proc. 22nd Int. Conf. World Wide Web Comput., pp. 613-620, 2013.

3. A. Mislove, M. Marcon, K. P. Gummadi, P. Druschel and B. Bhattacharjee, "Measurement and analysis of online social networks", Proc. 7th ACM SIGCOMM Conf. Internet Meas., pp. 29-42, 2007.


4. G. Anastasi, "Urban and social sensing for sustainable mobility in smart cities", Proc. IFIP/IEEE Int. Conf. Sustainable Internet ICT Sustainability, pp. 1-4, 2013.



5. A. Rosi, "Social sensors and pervasive services: Approaches and perspectives", Proc. IEEE Int. Conf. PERCOM Workshops, pp. 525-530, 2011.



6. T. Sakaki, M. Okazaki and Y. Matsuo, "Tweet analysis for real-time event detection and earthquake reporting system development", IEEE Trans. Knowl. Data Eng., vol. 25, no. 4, pp. 919-931, Apr. 2013.



7. J. Allan, Topic Detection and Tracking: Event-Based Information Organization, 2002, Kluwer.



8. K. Perera and D. Dias, "An intelligent driver guidance tool using location based services", Proc. IEEE ICSDM, pp. 246-251, 2011.



9. T. Sakaki, Y. Matsuo, T. Yanagihara, N. P. Chandrasiri, and K. Nawa, "Real-time event extraction for driving information from social sensors", Proc. IEEE Int. Conf. CYBER, pp. 221-226, 2012.



10. B. Chen and H. H. Cheng, "A review of the applications of agent technology in traffic and transportation systems", IEEE Trans. Intell. Transp. Syst., vol. 11, no. 2, pp. 485-497, Jun. 2010.



11. A. Gonzalez, L. M. Bergasa and J. J. Yebes, "Text detection and recognition on traffic panels from street-level imagery using visual appearance", IEEE Trans. Intell. Transp. Syst., vol. 15, no. 1, pp. 228-238, Feb. 2014.



12. N. Wanichayapong, W. Pruthipunyaskul, W. Pattara-Atikom and P. Chaovalit, "Social-based traffic information extraction and classification", Proc. 11th Int. Conf. ITST, pp. 107-112, 2011.


13. P. M. d'Orey and M. Ferreira, "ITS for sustainable mobility: A survey on applications and impact assessment tools", IEEE Trans. Intell. Transp. Syst., vol. 15, no. 2, pp. 477-493, Apr. 2014.



14. K. Boriboonsomsin, M. Barth, W. Zhu and A. Vu, "Eco-routing navigation system based on multisource historical and real-time traffic information", IEEE Trans. Intell. Transp. Syst., vol. 13, no. 4, pp. 1694-1704, Dec. 2012.


15. J. Hurlock and M. L. Wilson, "Searching twitter: Separating the tweet from the chaff", Proc. 5th AAAI ICWSM, pp. 161-168, 2011.


16. J. Weng and B.-S. Lee, "Event detection in Twitter", Proc. 5th AAAI ICWSM, pp. 401-408, 2011.


17. S. Weiss, N. Indurkhya, T. Zhang and F. Damerau, Text Mining: Predictive Methods for Analyzing Unstructured Information, 2004, Springer-Verlag.


18. A. Hotho, A. Nürnberger and G. Paaß, "A brief survey of text mining", LDV Forum-GLDV J. Comput. Linguistics Lang. Technol., vol. 20, no. 1, pp. 19-62, May 2005.


19. V. Gupta, S. Gurpreet and S. Lehal, "A survey of text mining techniques and applications", J. Emerging Technol. Web Intell., vol. 1, no. 1, pp. 60-76, Aug. 2009.


20. V. Ramanathan and T. Meyyappan, "Survey of text mining", Proc. Int. Conf. Technol. Bus. Manage., pp. 508-514, 2013.


21. M. W. Berry and M. Castellanos, Survey of Text Mining, 2004, Springer-Verlag.



22. H. Takemura and K. Tajima, "Tweet classification based on their lifetime duration", Proc. 21st ACM Int. CIKM, pp. 2367-2370, 2012.



23. No reference is included for item 23


24. A. Schulz, P. Ristoski and H. Paulheim, "I see a car crash: Real-time detection of small scale incidents in microblogs" in The Semantic Web: ESWC 2013 Satellite Events, vol. 7955, pp. 22-33, 2013, Springer-Verlag.


25. M. Krstajic, C. Rohrdantz, M. Hund and A. Weiler, "Getting there first: Real-time detection of real-world incidents on Twitter", Proc. 2nd IEEE Work Interactive Vis. Text Anal.—Task-Driven Anal. Soc. Media IEEE VisWeek, 2012.


26. C. Chew and G. Eysenbach, "Pandemics in the age of Twitter: Content analysis of tweets during the 2009 H1N1 outbreak", PLoS ONE, vol. 5, no. 11, pp. 1-13, Nov. 2010.



27. B. De Longueville, R. S. Smith and G. Luraschi, "OMG from here I can see the flames!: A use case of mining location based social networks to acquire spatio-temporal data on forest fires", Proc. Int. Work. LBSN, pp. 73-80, 2009.



28. J. Yin, A. Lampert, M. Cameron, B. Robinson and R. Power, "Using social media to enhance emergency situation awareness", IEEE Intell. Syst., vol. 27, no. 6, pp. 52-59, Nov./Dec. 2012.



29. P. Agarwal, R. Vaithiyanathan, S. Sharma and G. Shro, "Catching the long-tail: Extracting local news events from Twitter", Proc. 6th AAAI ICWSM, pp. 379-382, Jun. 2012.


30. F. Abel, C. Hauff, G.-J. Houben, R. Stronkman and K. Tao, "Twitcident: fighting fire with information from social web streams", Proc. ACM 21st Int. Conf. Comp. WWW, pp. 305-308, 2012.



31. R. Li, K. H. Lei, R. Khadiwala and K. C.-C. Chang, "TEDAS: A Twitter-based event detection and analysis system", Proc. 28th IEEE ICDE, pp. 1273-1276, 2012.



32. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann and I. H. Witten, "The WEKA data mining software: An update", SIGKDD Explor. Newsl., vol. 11, no. 1, pp. 10-18, Jun. 2009.


33. M. Habibi, Real World Regular Expressions with Java 1.4., 2004, Springer-Verlag.


34. Y. Zhou and Z.-W. Cao, "Research on the construction and filter method of stop-word list in text preprocessing", Proc. 4th ICICTA, vol. 1, pp. 217-221, 2011.


35. W. Francis and H. Kucera, "Frequency analysis of English usage: Lexicon and grammar", J. English Linguistics, vol. 18, no. 1, pp. 64-70, Apr. 1982.


36. M. F. Porter, "An algorithm for suffix stripping", Program: Electron. Library Inf. Syst., vol. 14, no. 3, pp. 130-137, 1980.



37. G. Salton and C. Buckley, "Term-weighting approaches in automatic text retrieval", Inf. Process. Manage., vol. 24, no. 5, pp. 513-523, Aug. 1988.



38. L. M. Aiello, "Sensing trending topics in Twitter", IEEE Trans. Multimedia, vol. 15, no. 6, pp. 1268-1282, Oct. 2013.



39. C. Shang, M. Li, S. Feng, Q. Jiang and J. Fan, "Feature selection via maximizing global information gain for text classification", Knowl.-Based Syst., vol. 54, pp. 298-309, Dec. 2013.



40. L. H. Patil and M. Atique, "A novel feature selection based on information gain using WordNet", Proc. SAI Conf., pp. 625-629, 2013.


41. M. A. Hall and G. Holmes, "Benchmarking attribute selection techniques for discrete class data mining", IEEE Trans. Knowl. Data Eng., vol. 15, no. 6, pp. 1437-1447, Nov./Dec. 2003.



42. H. Uğuz, "A two-stage feature selection method for text categorization by using information gain principal component analysis and genetic algorithm", Knowl.-Based Syst., vol. 24, no. 7, pp. 1024-1032, Oct. 2011.


43. Y. Aphinyanaphongs, "A comprehensive empirical comparison of modern supervised classification and feature selection methods for text categorization", J. Assoc. Inf. Sci. Technol., vol. 65, no. 10, pp. 1964-1987, Oct. 2014.


44. J. Platt, B. Schoelkopf, C. J. C. Burges and A. J. Smola, "Fast training of support vector machines using sequential minimal optimization" in Advances in Kernel Methods: Support Vector Learning, pp. 185-208, 1999, MIT Press.

45. G. H. John and P. Langley, "Estimating continuous distributions in Bayesian classifiers", Proc. 11th Conf. Uncertainty Artif. Intell., pp. 338-345, 1995.


46. J. R. Quinlan, C4.5: Programs for Machine Learning, 1993, Morgan Kaufmann.


47. D. W. Aha, D. Kibler and M. K. Albert, "Instance-based learning algorithms", Mach. Learn., vol. 6, no. 1, pp. 37-66, Jan. 1991.


48. E. Frank and I. H. Witten, "Generating accurate rule sets without global optimization", Proc. 15th ICML, pp. 144-151, 1998.


49. C. Cortes and V. Vapnik, "Support-vector networks", Mach. Learn., vol. 20, no. 3, pp. 273-297, Sep. 1995.


50. T. T. Cover and P. E. Hart, "Nearest neighbour pattern classification", IEEE Trans. Inf. Theory, vol. IT-13, no. 1, pp. 21-27, Jan. 1967.


51. W. W. Cohen, "Fast effective rule induction", Proc. 12th ICML, pp. 115-123, 1995.


52. G. Pagallo and D. Haussler, "Boolean feature discovery in empirical learning", Mach. Learn., vol. 5, no. 1, pp. 71-99, Mar. 1990.


53. J. Derrac, S. Garcia, D. Molina and F. Herrera, "A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms", Swarm Evol. Comput., vol. 1, no. 1, pp. 3-18, Mar. 2011.


54. F. Wilcoxon, "Individual comparisons by ranking methods", Biometrics Bull., vol. 1, no. 6, pp. 80-83, Dec. 1945.


55. H. Becker, M. Naaman and L. Gravano, "Beyond trending topics: Real-world event identification on Twitter", Proc. 5th AAAI ICWSM, pp. 438-441, 2011. 56. H. Kwak, C. Lee, H. Park and S. Moon, "What is Twitter a social network or a news media?", Proc. ACM 19th Int. Conf. WWW, pp. 591-600, 2010.



Keywords: Twitter-Traffic-Status


Language: en

NEW SEARCH


All SafetyLit records are available for automatic download to Zotero & Mendeley
Print