DIVA software

 

CoauthorAnalysis

Page history last edited by Steven Morris 1 yr ago

Coauthorship analysis

 

Analyis of paper authors is useful for mapping researchers and research teams in a specialty. Clustering is accomplished by building a cooccurrence matrix paper authors listing the number common papers for each pair of authors, i.e., a matrix of cooauthorship counts among all pair of paper authors in the dataset. Similarities based on these co-authorship counts are used as proximities in the clustering routine. Such clustering tends to identify the research teams in the specialty. Normally, only highly productive authors are included in the analysis.

 

Loading the paper to paper author matrix

 

  • Prior to co-authorship clustering, it is necessary to load the paper to paper author matrix from the database. To do this, on the TFGUI window select Matrix > load paper to paper author matrix. In the MATLAB command window, a few processing messges will appear, and a final message done loading paper-paper-author matrix.

 

Clustering paper authors

 

  • The paper to paper author matrix must be loaded into DIVA before coauthorship clustering can be performed. See the previous paragraph.
  • In the TFGUI window, go to the Cooccurrence menu and select Use author co-occurrence default. The cooccur_gui will appear. The primary entity will be paper author and the secondary entity will be paper. This means that paper authors will be clustered on cooccurrence in papers.
  • If the number of paper authors to cluster is greater than the number of clusters specified in coocur_gui, then DIVA will produce the number of clusters specified. Otherwise DIVA will produce cluster down to individual references, whatever that number is.
  • It is normally best to cluster down to individual paper authors, with number of paper authors between 50 to 200 depending on the size of the dataset. A good rule of thumb is to used 50 paper authors per 500 papers in the dataset.
  • In coauthorship clustering the occurrence threshold corresponds to the minimum number of papers an author has authored. Paper authors with more papers than the occurrence threshold are retained.
  • You should experiment with the occurence threshold to get close to the desired number of retained paper authors.
  • Set the occurence threshold, click on Execute. An overwrite dialog will appear, click OK on this, then a dialog will appear telling the number of items (paper authors) that will be clustered, and asking whether to continue.
  • If the number of items is too few, click NO and go back and reduce the coocurrence threshold, if too few, click NO and increase the threashold. If the number of items that will be clustered is close to the desired number, then click Yes and clustering will proceed.
  • Clustering will proceed quickly, but simulated annealing to seriate the dendrogram may take very long if many items are to be clustered.

 

 

This page viewed times.

Comments (0)

You don't have permission to comment on this page.