[SCC_Active_Members] RE: SCC_active Digest, Vol 9, Issue 1

Mike Walton walton at computerhistory.org
Thu Dec 2 09:04:31 PST 2004


Wow, This is a significant announcement.

We have taken the approach that we need to keep a "indexable" portion of
our repository available with more metadata in filenames and directories
so that bots like Google would find content.

With the indexing of Dspace repositories this lessens the need to try to
maintain data "outside" of the main archive databases..  since bots can
potentially spider the metadata directly within the repository.

I've kept an eye on Dspace and will be evaluating it for just the kinds
of application Lee calls for.  Software repository, archive, research
library.  Excellent news also that Dspace is based on Dublin Core.

Obviously it's a larger decision to adopt this open source tool globally
for the museum collections and archive, but it's worthy of scrutiny.

--
Mike Walton           Director of Information Technology
Computer History Museum          650.810.1040 (1055-fax)
Walton at computerhistory.org         1401 N Shoreline Blvd
http://www.computerhistory.org/        Mt View, CA 94043  




> -----Original Message-----
> From: scc_active-request at computerhistory.org 
> [mailto:scc_active-request at computerhistory.org] 
> Sent: Wednesday, December 01, 2004 12:00 PM
> To: scc_active at computerhistory.org
> Subject: SCC_active Digest, Vol 9, Issue 1
> 
> Send SCC_active mailing list submissions to
> 	scc_active at computerhistory.org
> 
> To subscribe or unsubscribe via the World Wide Web, visit
> 	http://mail.computerhistory.org/mailman/listinfo/scc_active
> or, via email, send a message with subject or body 'help' to
> 	scc_active-request at computerhistory.org
> 
> You can reach the person managing the list at
> 	scc_active-owner at computerhistory.org
> 
> When replying, please edit your Subject line so it is more 
> specific than "Re: Contents of SCC_active digest..."
> 
> 
> Today's Topics:
> 
>    1. 	RE: [Dspace-general] Google Scholar inclusion of DSpace
>       repositories (Lee Courtney)
> 
> 
> ----------------------------------------------------------------------
> 
> Message: 1
> Date: Tue, 30 Nov 2004 14:34:48 -0800
> From: "Lee Courtney" <lcourtney at mvista.com>
> Subject: [SCC_Active_Members] 	RE: [Dspace-general] 
> Google Scholar
> 	inclusion of DSpace repositories
> To: <scc_active at computerhistory.org>
> Cc: MacKenzie Smith <kenzie at mit.edu>
> Message-ID: <CKEHKGBEPGNFIIBBGIJAAEBDNIAA.lcourtney at mvista.com>
> Content-Type: text/plain;	charset="US-ASCII"
> 
> Dear fellow Software Collection Committee members:
> 
> Very interesting tidbit from the D-Space General mailing list:
> 
> > I wanted to mention that the new Google Scholar search
> > (http://scholar.google.com) is including items from DSpace 
> > repositories in the results, as long as they're open for harvesting 
> > the full-text.
> 
> This has *very* interesting implications for the Historic 
> Software Archive.
> If we base our repository infrastructure on a tool that 
> facilitates this external indexing and search inclusion, then 
> it appears content will get included in searches by Google et al.
> 
> At a minimum this is a requirement for whatever software we 
> end up with to housing artifacts in the Software Collection.
> 
> Thoughts?
> 
> Cheers,
> 
> Lee Courtney
> 
> MontaVista Software
> 1237 East Arques Avenue
> Sunnyvale, California 94085
> (408) 328-9238	voice
> (408) 328-9204	fax
> 
> > -----Original Message-----
> > From: dspace-general-bounces at mit.edu
> > [mailto:dspace-general-bounces at mit.edu]On Behalf Of MacKenzie Smith
> > Sent: Sunday, November 28, 2004 12:42 PM
> > To: dspace-general at mit.edu
> > Subject: [Dspace-general] Google Scholar inclusion of DSpace 
> > repositories
> >
> >
> > Hi all,
> >
> > I wanted to mention that the new Google Scholar search
> > (http://scholar.google.com) is including items from DSpace 
> > repositories in the results, as long as they're open for harvesting 
> > the full-text. I did notice that some institutions running 
> DSpace that 
> > should be there aren't yet, so I've asked Google why 
> they're missing.
> >
> > It can be a little tricky to figure out if you're institution is 
> > getting included or not -- search some known items from your 
> > repository and plow through all the results, and be sure to 
> check all 
> > the versions since your copy might not be one of the first 
> listed. If 
> > you're there, great, and if you're not (and want to be) then first 
> > make sure your repository's web server isn't blocking crawlers, and 
> > then write to me or them directly
> > (scholar-support at google.com) to make sure they crawl your site.
> >
> > They also wanted me to mention that if you have limited access 
> > material that you would like to get indexed by Google but 
> not cached 
> > by them for display, they're very interested in working 
> with you. For 
> > example, at MIT we have some book titles from the MIT Press in our 
> > DSpace repository which are only available for free to the MIT 
> > community. Google proposes to index them, but not cache 
> them, so that 
> > when a searcher finds one of them in a result set in google.com 
> > they're returned to DSpace to view the item and can get to 
> the Press's 
> > online ordering system from there. More traffic for the book, more 
> > money for the Press. Let me know if you're interested in 
> this and I'll 
> > put you in touch with the Google folks. Remember: if your DSpace 
> > content is freely available to the public then Google and the other 
> > web search engines should *already* be harvesting it so you 
> don't need 
> > to do anything...
> >
> > MacKenzie
> >
> >
> > MacKenzie Smith
> > Associate Director for Technology
> > MIT Libraries
> > Building E25-131d
> > 77 Massachusetts Avenue
> > Cambridge, MA  02139
> > (617)253-8184
> > kenzie at mit.edu
> >
> > _______________________________________________
> > Dspace-general mailing list
> > Dspace-general at mit.edu
> > http://mailman.mit.edu/mailman/listinfo/dspace-general
> 
> 
> ------------------------------
> 
> _______________________________________________
> SCC_active mailing list
> SCC_active at computerhistory.org
> http://mail.computerhistory.org/mailman/listinfo/scc_active
> 
> 
> End of SCC_active Digest, Vol 9, Issue 1
> ****************************************
> 
> 



More information about the SCC_active mailing list