1 / 37

CSCD506 Research Methods

CSCD506 Research Methods. Lecture 3 CS Body of Literature Fall 2018. Managing CS Literature. There are too many conferences, workshops, symposium one has to keep track of At least in most areas Same for journals, newsletters, bulletins etc. There is a huge amount of CS literature

seguinj
Download Presentation

CSCD506 Research Methods

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CSCD506 Research Methods Lecture 3 CS Body of Literature Fall 2018

  2. Managing CS Literature There are too many conferences, workshops, symposium one has to keep track of At least in most areas Same for journals, newsletters, bulletins etc. There is a huge amount of CS literature Including theses, online books, technical reports... Agree?

  3. Managing CS Literature Solution Make extensive use of existing literature search tools Assuming that you know what your research focus is... Started this process … with EWU library

  4. More CS Collections Computer Science Bibliography Server (DBLP)‏ We didn't talk about this one, next slides Other sources ACM Portal IEEE Xplore Google Scholar Science Direct CiteSeer Networked Computer Science Technical Reference Library (NCSTRL)‏ Collection of Computer Science Bibliographies, next slides

  5. http://dblp.uni-trier.de

  6. Dblp Computer ScienceBibliography What is DBLP? The dblp computer science bibliography is the on-line reference for bibliographic information on major computer science publications. It evolved from an early small experimental web server to a popular open-data service for the computer science community. Mission is to support computer science researchers in their daily efforts by providing free access to high-quality bibliographic meta-data and links to the electronic editions of publications As of May 2016, dblp indexes over 3.3 million publications, published by more than 1.7 million authors. dblp indexes about than 32,000 journal volumes, more than 31,000 conference or workshop proceedings, and more than 23,000 monographs.

  7. Computer Science Bibliographies http://liinwww.ira.uka.de/bibliography/

  8. Computer Science Bibliographies What are the Computer Science Bibliographies? This is a collection of bibliographies of scientific literature in computer science from various sources, covering most aspects of computer science. The bibliographies are updated weekly from their original locations Currently contains more than 7 million references (mostly to journal articles, conference papers and technical reports), clustered in about 1500 bibliographies, and consists of more than 2.3 GBytes (530MB gzipped) of BibTeX entries. More than 600 000 references contain cross-references to citing or cited publications. More than 1 million references contain URLs to an online version of the paper. Abstracts are available for more than 800 000 entries. There are more than 2000 links to other sites carrying bibliographic information.

  9. Microsoft Academic http://academic.research.microsoft.com/ Help explanation page: https://academic.microsoft.com/#/faq

  10. Microsoft Academic How is MA different from other academic search engines? MA is a semantic search engine, not a keyword-based one. Traditional search engines rely mostly on keyword matching. Usually, they match the keywords you type in the search field with words found in the indexed content. Accuracy of the search results depends on the quality of the keywords you type, which puts the responsibility of a successful search on the user MA is different because it employs natural language processing to understand and remember the knowledge conveyed in each document. MA then applies a technique known as semantic inference to recognize user's intent and to proactively deliver results relevant to the user's intention. As a result, MA can process complex queries and can provide rich and knowledgeable answers.

  11. Citeseer http://citeseer.ist.psu.edu/

  12. Literature Searches • Useful one is an entire journal of Surveys • Survey/Review Papers • ACM Computing Surveys Journal http://dl.acm.org/citation.cfm?id=J204 • Wikipedia Overview of ACM Computing Surveys http://en.wikipedia.org/wiki/ACM_Computing_Surveys

  13. Assessing Relevance Journals What are the leading journals in your area ? Are papers of sufficiently high quality ? Are members of editorial board active researchers ? Does the journal have a reasonable impact factor? Is it refereed? Conferences Can you identify leading conferences covering your area? What are the acceptance rates? What is the quality of the papers accepted?

  14. Impact Factor The impact factor, IF What is the Impact Factor? Measure reflecting average number of citations to articles published in science and social science journals Frequently used as a proxy for relative importance of journal within its field, Journals with higher impact factors deemed more important than those with lower ones Impact factors calculated yearly for those journals indexed in Thomson Reuter's Journal Citation Reports

  15. Impact Factor Impact factor was devised by Eugene Garfield, Founder of the Institute for Scientific Information (ISI), now part of Thomson Reuters By indexing scholarly work by citation, Garfield allowed researchers to track What other works a paper has referenced, and How many times others have cited a paper By counting citations, “Impact Factor” could be measured, assigning an indicator of quality to more influential works Article on Impact of Impact Factor https://clarivate.com/essays/history-citation-indexing/

  16. Impact Factor “Even if there were no other use for a citation index than that of minimizing the citation of poor data, the index would be well worth the effort required to compile it,” Garfield wrote in his 1955 paper “Citations are an acknowledgment of intellectual debt. For example, Web of Science lets researchers instantly recognize works that are well regarded by their peers That way, they know they are basing their work on quality research.”

  17. Calculating Impact Factor WHAT IS AN IMPACT FACTOR? The journal impact factor is a measure of the frequency with which the average article in a journal has been cited in a particular year. The impact factor helps you evaluate a journal's relative importance, especially when you compare it to others in the same field. The impact factor is calculated by dividing the number of citations in the current year to articles published in the two previous years by the total number of articles published in the two previous years. Using Journal X as an example: Citations in 2007 to articles published in: 2006 1266 2005 1643 sum 2909 Number of articles published in: 2006 352 2005 364 Sum 716 Calculation: Total cites to articles published in 2005 and 2006 2909 = 4.063 Number of articles 716 published in 2005 and 2006 The 2007 Impact Factor for Journal X is 4.063 Calculating Impact Factor Calculating Impact Factor

  18. Assessing Author Relevance Authors Identify leading individual researchers/authors working in your area In what journals do they regularly publish? Who is the most cited author(s)? Who publishes the most? Who produces students? Where are their students employed?

  19. Identifying Notable Authors Not as easy as a journal citation index Look at number of times author cited Also, the quality of conferences Body of work from this author His/Her graduate students His/Her institution, prestigious or not All of the above will provide some clues as to the importance of the author in his/her field Also something called h-index and g-index Library Explains h-index, g-index and i-10index Barbara Liskov of MIT

  20. Indexes for Authors h-index = h has at least h papers that have been cited h times. h-index is an author-level metric that attempts to measure both productivity and citation impact of publications of a scientist or scholar For example, a researcher with an h-index of 22 has 22 papers that have been cited at least 22 times. http://theconversation.com/explainer-what-is-an-h-index-and-how-is-it-calculated-41162 g-index = [Given a set of articles] ranked in decreasing order of the number of citations that they received, g-index is the (unique) largest number such that the top g articles received (together) at least g² citations." A g-index of 20 means that and academic has published at least 20 articles that combined have received at least 400 citations. https://harzing.com/resources/publish-or-perish/tutorial/metrics/h-and-g-index i10-index = metric used by Google Scholar, is the number of publications with at least 10 citations for all of the citations listed in your profile.

  21. Conferences vs. Journals

  22. Conferences vs. Journals in CS Which are Better? Conferences are Better? Conferences have higher status It is self-perpetuating since best researchers want to send their papers to conferences rather than journals Conferences provide higher visibility and greater impact Many people will attend your talk, you will have the opportunity to answer questions, and people will talk to both you and to one another in the hallways Even disregarding the event itself, more non-attendees read conference proceedings than read journals

  23. Conferences vs. Journals Conferences have higher quality Acceptance rates to good conferences are often around 10%, at least in software engineering .. whereas even best journals are less selective There does exist low-quality conferences (and journals), but if your c.v. is cluttered with them, then you will appear to be incapable of good work (even if the work you published in those venues really is good!), and your good publications will not stand out A good rule of thumb is that the best conferences are sponsored by ACM or IEEE

  24. Conferences vs. Journals Conferences are more timely It can take years for a journal publication to appear (or even for reviews to come back), Whereas turnaround time for conference reviews is a few months, and proceedings also appear quickly Conferences have higher standards of novelty Journals often only require 20-30% of the material to be new, compared to an earlier conference version

  25. Conferences vs. Journals Why journals are better Journals may have longer page limits If you have too many experimental results to fit in a conference publication, then a journal affords an opportunity to include them You can also include proofs that are too long (or boring) for a shorter publication. Journal reviews tend to be more detailed A journal reviewer may spend days on a paper, whereas a conference reviewer cannot afford to do so for each of the many papers he or she is assigned

  26. The Case for Journals Journals have higher acceptance rates More chance to get your research published Same is true of workshops Good venues for people who are just starting their research careers Some lesser-ranked universities evaluate faculty on basis of journal publications Because Dean of Engineering is unable or unwilling to understand computer science A top-ranked CS department can convince the dean to use the proper evaluation metric

  27. Case for Journals Best papers at a conference are often solicited for expedited journal publication Whether you accept this invitation should be based on factors above, such as whether there is value to the community of an expanded version of the paper, and how much more work it is to prepare the journal version http://www.cs.washington.edu/homes/mernst/advice/ conferences-vs-journals.html

  28. A Study that Confirms This As stated previously Published yearly by Thomson ISI Journal Citation Report (JCR) by counts of the citations from articles of thousands of journals Research results in computer science are often published in high-quality conferences which are not covered by the JCR citation databases List of Impact Factors http://www.sciencegateway.org/rank/index.html

  29. Conferences Rule Study of Conferences vs Journals http://iospress.metapress.com/content/v3343450241483q7/ The majority of papers appeared in conferences and workshops, not in journals Furthermore, total number of citations is higher for conferences and workshops than for journals

  30. Conferences Rule Authors used cleaned citation data from GoogleScholar for an in-depth citation analysis for database research Analyzed all publications over a period of 10 years (1994–2003) which appeared in top Database conferences and top Database journals Two top conferences (ACM Sigmod, VLDB) not only publish many more papers than the top journals (ACM TODS, VLDB Journal) But that they receive many more citations in total and per paper

  31. Conferences Rule

  32. Citation from Study Comparing the scientific impact of conference and journal publications in computer science By Erhard Rahm http://iospress.metapress.com/content/v3343450241483q7/

  33. How to find Conferences and Journals for your Specialty ACM http://www.acm.org/conferences IEEE - Communications Society http://www.comsoc.org/conferences/conferencesearch IEEE Computer Society http://www.computer.org/portal/web/conferences/calendar Computer Security http://www.ieee-security.org/Calendar/cipher-hypercalendar.html

  34. Journals and Magazines IEEE Computer Society Publications http://www.computer.org/portal/web/csdl/home ACM Digital Libaray http://portal.acm.org/

  35. Summary Conferences vs. Journals • Conferences • More impact in the CS field • Many more conferences than journals • More current results, more papers • Journals • Longer time to publication • More thorough review process • Longer, more thorough papers (in general)‏

  36. End New Assignment See the Assignments page On trying out Databases of articles

More Related