Frequently Asked Questions
Who is behind CSIndexbr?
CSIndexbr is maintained by the Applied
Software Engineering Research Group, from Federal University of Minas Gerais, Brazil.
However, our results are not endorsed by UFMG.
We also cannot assure the absence of errors, inconsitencies,
and missing entries in the data provided by CSIndexbr. For this reason,
the use of our data is at your own risk.
How to cite CSIndexbr?
Please cite this paper:
CSIndexbr: Exploring the Brazilian Scientific Production in Computer Science.
Marco Tulio Valente, Klerisson Paixao.
arXiv abs/1807.09266, 2018.
You can also check the following papers that use our data:
- Uma Análise da Produção Científica Brasileira em Conferências de
Manutenção e Evolução de Software.
Klérisson Paixão; Marco Tulio Valente; Marcelo Maia.
6th Brazilian Workshop on Software Visualization,
Evolution and Maintenance, p. 1-8, 2018.
How CSIndexbr works?
Papers are collected from DBLP.
In the case of conferences, we only collect full papers, accepted in the
main research track of the event. We also collect citations from
and links to preprints from arXiv (see figure).
What is the license used by CSIndexbr?
Our code is available under a MIT
our data is available under
a CC BY-NC-SA 4.0
license. The latter grants free and non-commercial usage of our data,
including adaptations. However, proper credit should be granted,
by linking to CSIndexbr site and/or citing this
We also recommend to mention
since it is the primary source of the data collected by CSIndexbr.
What is your definition of a full paper?
It is a paper presented in the main track of a conference. We also apply a
minimum page size threshold, such as:
- Software Engineering, Programming Languages, Information Systems, Databases,
Web & Information Retrieval: 10 pages
- Computer Networks, Distributed Systems, Human-Computer Interaction: 8 pages
Do you plan to cover short papers (including demos, tool papers, early research
papers, industry track papers etc)?
No, in the case of conferences, we only collect full papers, describing
mature and carefully evaluated work; these papers
should be viewed as journal-quality papers.
Why my paper is not listed?
In the case of conferences, check if it is a full paper, published in the main
track of the event, in the last five years. If this the case, please
use this form
and provide data about your paper.
Which conferences are tracked?
We do not intend to cover all CS conferences. Our interest is on journal-quality
conferences, with good metrics and well-known sponsors (ACM SIGs, IEEE CS, etc).
Particularly, conferences should attend the following thresholds:
- submitted > 100 papers
- acceptance < 30%
- h5-index > 20
We also do not track multi-conferences, since it is not possible to
retrieve the h5-index of each of their conferences or tracks.
Furthermore, we set up a limit of 15 conferences per
research area (the exception is Computer Networks, with 18 conferences;
but this area also includes Mobile Computing).
Which journals are tracked?
First, journals must be indexed by Journal Citation Reports (JCR).
Second, we look for journals attending the following thresholds:
Furthermore, we set up a limit of 15 journals per research area (the exception
is Operational Research, where we are indexing 21 journals; however, this area
is the onely one that does not have conferences indexed by CSIndexbr). Finally,
there is also a limit of at most two magazines per research area.
- Why having a limit of 15 journals per area? Because our goal is
to index only the most important venues (journals or conferences) of
each area. To access the full list of publications of a researcher, please
check his/her DBLP page.
- Why JCR's impact factor is not used?
Because JCR has a restrict license; for example "sharing of JCR data
outside a subscriber’s institution is strictly prohibited".
Why conference [Conf] or [Journal] are not covered?
We accept suggestions to track other conferences or journals. However,
please check if they attend the thresholds defined in the previous
If you think this is the case, please
use this form
to inform the metrics about [Conf] or [Journal].
How do you define top-conferences?
First, top-conferences should have:
- submitted > 180 papers and
- h5-index > 30.
Moreover, whenever possible we define a limit of 3 top-conferences
per research area.
- Why some areas have more than 3 top-conferences? Because
they are a combination or include distint areas. For example, we have
five top-conferences in Computer Networks, but it
includes Mobile Computing, which has even a dedicated ACM SIG.
- Why not divide an area with more than 3 top-conferences
in areas X and Y?
Because there are journals that publish papers in X and Y.
How do you define top-journals?
We define as top-journals the Transactions published by ACM and IEEE
Computer Society (or IEEE Computational Intelligence Society, in the case of
AI journals; or IEEE Robotics and Automation Society, in the case of
Robotics journals; or INFORMS, in the case of Operational Research).
Moreover, there is a limit of 3 top-journals
per resarch area.
What is the meaning of "other" in the journals Rank column (Stats tab)?
These journals are scored as conferences (weight 0.33, instead of 0.40).
The following journals are classified in this category:
- Magazines and journals that accept short papers (which must have at least 6 pages)
- Journals that accept more than 500 papers/year
- Journals with normalized-h5-index < 0.2
How is the departments' score computed?
Using this formula:
- score = A + (0.4 * B) + (0.33 * C)
- A = number of papers in top-venues (journals or conferences)
- B = number of papers in journals
- C = number of papers in conferences, magazines, journals that accept
short papers, or in "other" journals
How are citations collected?
We collect citations using Crossref API.
is an official DOI registration agency. Whenever a publisher asks
Crossref for a DOI, it must provide several metadata about the paper being
registered, such as title, authors, date of publication, etc. Publishers
can also provide the DOIs of each reference contained in the paper.
In this way, Crossref is creating a database of citations; given a DOI,
this database stores other DOIs that cite it.
Other questions related to citations:
- Why are you not using Google Scholar citations? Because they are
"closed"; Google’s terms of service forbids data collection by bots
or similar programs.
- Why the number of Crossref and Google Scholar citations do not
There are two major reasons:
- First, because Crossref only considers citations coming from documents
with DOIs. Therefore, citations made in books, theses, preprints,
informal or workshop papers, etc are not considered. Interestingly, this
concludes that "Google Scholar unique citations have, on average, a much
lower scientific impact than citations also found by WoS/Scopus".
- The second reason is
because some publishers do not provide citations metadata, when registering
papers with Crossref.
- Who else uses Crossref citations? Users include the Cited-by service
of many journals (Wiley, Elsevier etc), scholarly search engines (e.g.
Digital Science's Dimensions) and
several other scholarly communication analysis tools (see a list in this
How often is your data updated?
We are pushing at least one update per month (including papers,
citations, and arXiv preprints). Latest update dates are informed
at the bottom of the Statistics page.