[1] Daly J. TRB Webinar: Learning About and Using the Research in Progress (RiP) Database 2016:14. http://www.trb.org/ElectronicSessions/Blurbs/174599.aspx.
[2] Gopalakrishnan K, Khaitan SK. TEXT MINING TRANSPORTATION RESEARCH GRANT BIG DATA: KNOWLEDGE EXTRACTION AND PREDICTIVE MODELING USING FAST NEURAL NETS. Int J TRAFFIC Transp Eng 2017;7. doi:10.7708/ijtte.2017.7(3).06.
[3] Foster DP, Liberman M, Stine RA. Featurizing Text: Converting Text into Predictors for Regression Analysis. Whart Sch Univ Pennsylvania, Philadelphia, PA 2013.
[4] Argamon S, Koppel M, Pennebaker JW, Schler J. Automatically profiling the author of an anonymous text. Commun ACM 2009;52:119–23.
[5] Schwartz HA, Eichstaedt JC, Kern ML, Dziurzynski L, Ramones SM, Agrawal M, et al. Personality, gender, and age in the language of social media: The open-vocabulary approach. PLoS One 2013;8:e73791.
[6] Rosenthal S, McKeown K. Age prediction in blogs: A study of style, content, and online behavior in pre-and post-social media generations. Proc. 49th Annu. Meet. Assoc. Comput. Linguist. Hum. Lang. Technol. 1, Association for Computational Linguistics; 2011, p. 763–72.
[7] Nguyen D, Smith NA, Rosé CP. Author age prediction from text using linear regression. Proc. 5th ACL-HLT Work. Lang. Technol. Cult. Heritage, Soc. Sci. Humanit., Association for Computational Linguistics; 2011, p. 115–23.
[8] Joshi M, Das D, Gimpel K, Smith NA. Movie reviews and revenues: An experiment in text regression. Hum. Lang. Technol. 2010 Annu. Conf. North Am. Chapter Assoc. Comput. Linguist., Association for Computational Linguistics; 2010, p. 293–6.
[9] Singhal A, Kasturi R, Srivastava J. Automating Document Annotation Using Open Source Knowledge. 2013 IEEE/WIC/ACM Int. Jt. Conf. Web Intell. Intell. Agent Technol., vol. 1, IEEE; 2013, p. 199–204. doi:10.1109/WI-IAT.2013.30.
[10] Singhal A, Srivastava J. Research dataset discovery from research publications using web context. Web Intell 2017;15:81–99. doi:10.3233/WEB-170354.
[11] Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. J Mach Learn Res 2003;3:993–1022.
[12] Landauer TK. Latent Semantic Analysis. Encycl. Cogn. Sci., Chichester: John Wiley & Sons, Ltd; 2006. doi:10.1002/0470018860.s00561.
[13] Le Q, Mikolov T. Distributed representations of sentences and documents. Int. Conf. Mach. Learn., 2014, p. 1188–96.
[14] Witten IH, Frank E, Hall MA, Pal CJ. Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann; 2016.
[15] Breiman L. Random Forests. Mach Learn 2001;45:5–32. doi:10.1023/A:1010933404324.
[16] Holte RC. Very Simple Classification Rules Perform Well on Most Commonly Used Datasets. Mach Learn 1993;11:63–90. doi:10.1023/A:1022631118932.
[17] Lai T., Robbins H, Wei C. Strong consistency of least squares estimates in multiple regression II. J Multivar Anal 1979;9:343–61. doi:10.1016/0047-259X(79)90093-9.