Gender Distribution across Topics in the Top 5 Economics JournalsA Machine Learning Approach

  1. J.Ignacio Conde-Ruiz 1
  2. Juan-José Ganuza 2
  3. Manu García 3
  4. Luis A. Puch 4
  1. 1 Fedea Universidad Complutense de Madrid and ICAE
  2. 2 Universitat Pompeu Fabra and Barcelona GSE
  3. 3 Washington University in St. Louis and ICAE
  4. 4 Universidad Complutense de Madrid and ICAE
Revista:
Documentos de Trabajo (ICAE)

ISSN: 2341-2356

Año de publicación: 2021

Número: 9

Páginas: 1-54

Tipo: Documento de Trabajo

Otras publicaciones en: Documentos de Trabajo (ICAE)

Resumen

We analyze all the articles published in the top five (T5) Economics journals between 2002 and 2019 in order to find gender differences in their research approach. We implement an unsupervised machine learning algorithm: the Structural Topic Model (STM), so as to incorporate gender document-level metadata into a probabilistic text model. This algorithm characterizes jointly the set of latent topics that best fits our data (the set of abstracts) and how the documents/abstracts are allocated to each latent topic. Latent topics are mixtures over words where each word has a probability of belonging to a topic after controlling by journal name and publication year (the meta-data). Thus, the topics may capture research fields but also other more subtle characteristics related to the way in which the articles are written. We find that fe- males are unevenly distributed along the estimated latent topics, by using only data driven methods. This finding relies on “automatically” generated built-in data given the contents in the abstracts of the articles in the T5 journals, without any arbitrary allocation of texts to particular categories (as JEL codes, or research areas).

Referencias bibliográficas

  • Bagues, Manuel and Pamela Campa, “Can Gender Quotas in Candidate Lists EmpowerWomen? Evidence from a Regression Discontinuity Design,” 2017, (12149).
  • Bayer, Amanda and Cecilia E. Rouse, “Diversity in the Economics Profession: A New Attack on an Old Problem,” Journal of Economic Perspectives, Nov. 2016, 30 (4), 221–42.
  • Beneito, P., J. E. Bosc´a, J. Ferri, and M. García, “Women across Subfields in Economics: Relative Performance and Beliefs,” Fedea WP, June 2018, (2018 - 06).
  • Blei, David M., Andrew Y. Ng, and Michael I. Jordan, “Latent Dirichlet Allocation,” J. Mach. Learn. Res., March 2003, 3 (null), 993 – 1022.
  • Boustan, Leah and Andrew Langan, “Variation in Women’s Success across PhD Programs in Economics,” Journal of Economic Perspectives, February 2019, 33 (1), 23–42.
  • Buckley, Chris, “Implementation of the SMART Information Retrieval System,” Technical Report, USA 1985.
  • Cabrales, A., M. García, and L. A. Puch, “Gendered Language in the British Press,” Mimeo COSME Gender, at 2018 Meetings of the Spanish Economic Association, 2018.
  • Card, David and Stefano DellaVigna, “Nine Facts about Top Journals in Economics,” Journal of Economic Literature, March 2013, 51 (1), 144–61.
  • Card, David and Stefano DellaVigna, Patricia Funk, and Nagore Iriberri, “Are Referees and Editors in Economics Gender Neutral?,” The Quarterly Journal of Economics, 11 2019, 135 (1), 269–327.
  • Chari, Anusha and Paul Goldsmith-Pinkham, “Gender Representation in Economics Across Topics and Time: Evidence from the NBER Summer Institute,” Working Paper 23953, National Bureau of Economic Research October 2017.
  • Chevalier, Judy, “The 2020 Report of the Committee on the Status of Women in the Economics Profession,” 2020.
  • Conde-Ruiz, J. Ignacio, Juan-José Ganuza, and Paola Profeta, “Statistical Discrimination and the Efficiency of Quotas,” Fedea Working Papers, 2017.
  • Conde-Ruiz, Juan José Ganuza, and Paola Profeta, “Statistical Discrimination and Committees,” Fedea Working Papers, February 2021, (2021-06).
  • Dolado, Juan, Florentino Felgueroso, and Miguel Almunia, “Are men and womeneconomists evenly distributed across research fields? Some new empirical evidence,” SERIEs: Journal of the Spanish Economic Association, September 2012, 3 (3), 367–393.
  • Hansen, Stephen, Michael McMahon, and Andrea Prat, “Transparency and Deliberation Within the FOMC: A Computational Linguistics Approach,” The Quarterly Journal of Economics, 10 2017, 133 (2), 801–870.
  • Heckman, James J. and Sidharth Moktan, “Publishing and Promotion in Economics: The Tyranny of the Top Five,” Journal of Economic Literature, June 2020, 58 (2), 419–70.
  • Hengel, E., “Publishing while Female. Are women held to higher standards? Evidence from peer review,” Cambridge Working Papers in Economics 1753, Faculty of Economics, University of Cambridge December 2020.
  • Hengel, Erin and Eunyoung Moon, “Gender and quality at top economics journals,” Working Papers 202001, University of Liverpool, Dept. of Economics February 2020.
  • Lundberg, Shelly and Jenna Stearns, “Women in Economics: Stalled Progress,” Journal of Economic Perspectives, February 2019, 33 (1), 3–22.
  • Mimno, David, Hanna Wallach, Edmund Talley, Miriam Leenders, and Andrew McCallum, “Optimizing Semantic Coherence in Topic Models,” 2011, pp. 262 – 272.
  • Roberts, Margaret E., Brandon M. Stewart, and Dustin Tingley, “stm: An R Package for Structural Topic Models,” Journal of Statistical Software, 2019, 91 (2), 1–40.
  • Roberts, Margaret E., Brandon M. Stewart, Christopher Lucas, Jetson Leder-Luis, Shana Kushner Gadarian, Bethany Albertson, and David G. Rand, “Structural Topic Models for Open-Ended Survey Responses,” American Journal of Political Science, 2014, 58 (4), 1064–1082.
  • Siniscalchi, Marciano and Pietro Veronesi, “Self-image Bias and Lost Talent,” December 2020, (28308).
  • Tang, Cong, Keith Ross, Nitesh Saxena, and Ruichuan Chen, “What’s in a Name: A Study of Names, Gender Inference and Gender Behavior in Facebook,” in “Xu J., Yu G., Zhou S., Unland R. (eds) Database Systems for Advanced Applications Lecture Notes in Computer Science, vol 6637,” Springer Berlin Heidelberg, 2011, pp. 344 – 356.