Haal meer uit gewone zoekmachines Wouter Gerritsma, Vrije Universiteit VOGIN Informatie overload? Verminder je de recall! • • • • • Gebruik minder database(s) (delen) Limiteer op taal Limiteer op publicatiejaar Limiteer op velden Gebruik nabijheidsoperatoren i.p.v. AND • Gebruik specifiekere termen • Voeg een aspect toe Gebruik andere database delen • Google.nl, Google.com, Google.co.uk – books.google.com – scholar.google.com – patents.google.com • Bing.com vs academic.research.microsoft.com Be warned Limiteer op taal of regio • Via de advanced search Limiteer op periode • In Google Scholar Pas op met sort by date • In Google patents • In Google – [“water management” 2000…2010] Zoek in specifieke velden (ti,kw,ab) • [intitle:”water management”] • [allintitle:water management] – Werkt precies zo in Google Scholar • [intext:”water management”] • [allintext:water management] Nabijheidsoperatoren in Google • [“water management” “technology assessment”] • [“water management” AROUND(5) “technology assessment”] – Vooral effectief om relevante lange teksten te vinden – Werkt helaas niet in Google Scholar! Gebruik specifiekere term • ["heart disease" "heart attack"] • ["heart disease" "myocardial infarction"] • Op zoek naar professionele termen? – [chocolate thesaurus|glossary|dictionary] Voeg een aspect toe • ["water management"] • ["water management" "technology assessment"] Boolean en Google: haakjes werken niet! OR gaat voor AND! • a b OR c d • a AND (b OR c) AND d • a OR b c OR d • (a OR b) AND (c OR d) • [library technology OR systems] • [kinderen OR jongens OR meisjes OR tieners OR pubers dik OR zwaarlijvig OR zwaar OR overgewicht ziek OR ziekelijk OR ziekte OR ziektebeeld] Additionele operatoren • site:[bibliotheek site:wageningenur.nl] – [site:*.nasa.* inurl:education] • ext:[crinipellis perniciosa ext:pdf] Specifiek Google Scholar General Google Scholar syntax • • • • • • • Use ["phrase searching"] Search for file type ["land degradation" ext:doc] Search for specific domains ["land ownership" site:.ac.uk] !!! Search in titles [allintitle:anatomy plants water] Search with OR ["carbon dioxide" OR CO2] Exclude terms ["nutrient cycling" –nitrogen] Combine ["carbon dioxide" OR co2 intitle:"nutrient cycles" phosphorus ext:pdf] Pas de settings aan! • Search results – Number of results : 20 max – Open in a new window – Choose the right reference manager • Languages – Also in available in Dutch • Library links – Choose link resolver of your University Searching for recent literature! • Use publication years – In the advanced search – Or use the facets in search result – Sort by date??? • Inclusion of word variants is unclear. (Google Scholar has a smaller dictionary than Google.com) • No verbatim search, but phrases do work! ["behaviour"] • Parentheses () do not work in Google (Scholar)! Search engine result page • Ranking influenced by citations (older articles on top!) Alerts galore • For searching GS strings must be kept under 256 characters. • For alerts GS strings must be kept under 100 characters • My updates : Alerts based on the topics of your publications (different from my Citations) My Citations & My Library • My Citations – Your personal profile page, concentrating on publication lists and citations. – Identifies you co-author network • My library – Your saved articles – Also lists all your references "cited by me" – No plugin for word processors (yet) Google Scholar quality • Coverage is still unclear – Coverage is constantly growing, probably the most comprehensive scholarly literature database (Harzing, 2014. Scientometrics 98 (1): 565-575. http://dx.doi.org/10.1007/s11192-013-0975-y) – Not all OA repositories are (fully) indexed (https://atmire.com/website/?q=content/google-scholar-and-dspace) – Concentrates on scientific articles, share of books is growing, Grey Lit from repositories partially included • For systematic reviews Pubmed alone is insufficient, complementary searches in GS and other indexes are a necessity • Metadata sometimes problematic, the version function under the snippet helps (Bramer, et al. 2013. Systematic Reviews 2 (1): 115. http://dx.doi.org/10.1186/2046-4053-2-115) Google Scholar and Google • Google and Google Scholar are separate indexes! • Google Scholar results are pushed in Google when searching for scientific subjects – Make use of that reminder! – But not the other way around • Some repositories are covered by Google but not in Google Scholar, or partly covered in Google Scholar – Google Scholar [site:edepot.wur.nl] – Google [site:edepot.wur.nl] Google scholar is getting better at metadata • https://scholar.google.com/scholar_lookup?doi=10.1126%2Fscience.1152509 &issn=0036-8075&issue=5857&journal=SCIENCE&hl=en&pages=17371742&pmid=18079392&title=Coral%20reefs%20under%20rapid%20climate% 20change%20and%20ocean%20acidification&author=O%20HoeghGuldberg&publication_year=2007&volume=318 • https://scholar.google.com/scholar_lookup?doi=10.1126/science.1152509 • https://scholar.google.com/scholar_lookup?issn=00368075&issue=5857&pages=1737-1742&author=HoeghGuldberg&publication_year=2007&volume=318 Bronnenkennis Wouter Gerritsma, Vrije Universiteit VOGIN Wetenschappelijk zoekmachines • Google Scholar • Microsoft Academic Search • DeepDyve • Defunct: SciRus Bibliografiën/metadazoekmachines • • • • PubMed http://pubmed.gov/ Eric http://www.eric.ed.gov/ LISTA http://www.libraryresearch.com/ WorldWideScience http://worldwidescience.org/ Boeken Catalogi • Bibliotheek catalogi bijv: http://library.wur.nl/WebQuery/clc/ • Open worldcat http://www.worldcat.org/ • Hathi Trust http://www.hathitrust.org/ • DOAB: http://www.doabooks.org/doab Fulltext boeken zoekers • Google Booksearch http://books.google.com/ • Hathi Trust http://www.hathitrust.org/ E-books aggregators • • • • • Proj Gutenberg http://www.gutenberg.org/wiki/Main_Page Online Books Page http://onlinebooks.library.upenn.edu/ Internet Archive texts http://www.archive.org/details/texts Gallica, Bibliothèque Numérique http://gallica.bnf.fr/ Freebooks for doctors http://www.freebooks4doctors.com/ Open Access Journals • • • • • • DOAJ http://www.doaj.org/ Open J-Gate http://www.openjgate.org/ LivRe! http://livre.cnen.gov.br/Default2I.asp PubMed Central http://www.pubmedcentral.nih.gov/ Highwire Press http://highwire.stanford.edu/ Elektronische Zeitschriftenbibliothek EZB http://rzblx1.uniregensburg.de/ezeit/index.phtml?bibid=AAAAA&colors=7 OA repositories OAI metadata zoekmachines • Narcis http://www.narcis.info/index • BASE http://www.base-search.net/ • OAISTER: http://oaister.worldcat.org/ • ArXiv http://arxiv.org/ • Driver http://www.driver-repository.eu/ Directories of repositories • ROAR http://roar.eprints.org/ • openDOAR http://www.opendoar.org/ Research data Research data repositories NL • DANS https://easy.dans.knaw.nl/ui/home • 3TU.Datacentum http://datacentrum.3tu.nl/ • Data portal • Datacite http://search.datacite.org/ui • Narcis http://www.narcis.nl/search/coll/dataset/Language/en searches DANS, 3TU.Datacentrum, the Language Archive and datasets from Tilburg University, Wageningen University, MPI, DANS-KNAW and CentERdata • Directories of research data repositories • Re3Data (in opbouw) http://service.re3data.org/search/ • OAD http://oad.simmons.edu/oadwiki/Data_repositories Grey Literature • • • • Greynet http://www.greynet.org/ Grijze Literatuur in Nederland (GLIN) http://picarta.pica.nl/xslt/DB=3.2/ GreySource http://www.greynet.org/greysourceindex.html OpenGrey http://www.opengrey.eu/ Dissertaties en Proefschriften • NDLTD http://www.ndltd.org/find (met verschillende suggesties) • DART Europe http://www.dart-europe.eu/basic-search.php • Narcis http://www.narcis.nl/search/coll/publication/genre/doctoralthes is/Language/en Patenten • • • • • • • • Patenten databases USPTO http://www.uspto.gov/patft/index.html Espace http://ep.espacenet.com/ WIPO http://www.wipo.int/pctdb/en/search-adv.jsp Patenten zoekmachines Google patent Search http://patents.google.com Cambia patent lens http://www.patentlens.net/ (life science patenten) Databases • • • • Genbank http://www.ncbi.nlm.nih.gov/Genbank/index.html PubChem http://pubchem.ncbi.nlm.nih.gov/ UniProt http://www.ebi.ac.uk/uniprot/index.html Database issue van NAR database overzicht. Wetenschappelijk nieuws • • • • • Eureka Alert http://www.eurekalert.org/index.php News@Nature http://www.nature.com/news/index.html Science Daily http://www.sciencedaily.com/ SciCentral http://www.scicentral.com/ Scientific American http://www.sciam.com/news_directory.cfm • New Scientist http://www.newscientist.com/news.ns • Noorderlicht http://noorderlicht.vpro.nl/ Source: http://deepwebtechblog.com/the-deep-web-is-not-all-dark/ Oorzaken van het diepe Web • • • • • De informatie zit in databases Zoekmachine limiteringen Website limiteringen Cognitieve factoren Web 2.0 Het databases probleem • Inhoud van databasese moeilijk te indexeren – Zoekmachines zien een invulscherm – Zoekmachines zien een “zoek” knop • Database inhoud niet geoptimaliseerd voor WWW, records ranken lag • Dit speelt ook met onze catalogi en repositories (Arlitsch, K., & O'Brien, P. S. (2012). Invisible institutional repositories: addressing the low indexing ratios of IRs in Google. Library Hi Tech, 30(1), 60-81. http://dx.doi.org/10.1108/07378831211213210) Zoekmachinelimiteringen Bron: http://drunkmenworkhere.org/219 (2006) Websitelimiteringen • • • • Robots.txt op de root directory Sitemap.xml voor grote sites Gebruik van javascript File formats zoals zip, flash worden niet/nauwelijks geindexeerd • Tekst files zijn niet ge-ocr’d Cognitieve factoren • Mensen kijken niet verder dan hun neus langs is • Bladeren niet voorbij de standaard 10 resultaten op 1e SERP Web 2.0 • Social media has exploded since 2005 • Large social media sites (Facebook, Instagram are closed/partially closed platforms) Oplossingen voor databases • Zoek databases waar je ze kunt verwachten – CBS verzamelt statistieken en heft Statline, CBS in uw buurt • Zoek naar databases over het onderwerp met als additionele termen woorden die naar databases verwijzen zoals: database, data, dataset, archive, bibliography, index, directory, register, zoek, search of statistics • ["plane crash" | "aviation accidents" database].