Saturday, February 7. 2009
SUMMARY: In “Authors: I don’t care where you deposit, just do it”, Gavin Baker suggests that it does not matter where authors deposit their papers to make them Open Access (OA): in an Institutional Repository (IR) or a Central Repository (CR). Nor, more importantly, does it matter where authors' funders mandate that they should deposit them, because IR deposits can be harvested to CRs and vice versa. I point out that this apparent symmetry between IRs and CRs with respect to the harvestability from one to the other (in either direction) is irrelevant today because most of the target content for OA is not yet even being deposited at all, anywhere: In other words, authors are decidedly not "just doing it." Nor are institutions -- the universal providers of OA's target output, both funded and unfunded, across all fields -- "just mandating that authors do it." Apart from the tiny number (about 30) that have already mandated deposit, institutions are the "slumbering giant" of OA, until they wake up and mandate the deposit of their own research output in their own IRs. Not all research output is funded, but all research output is institutional: Hence institutions are the universal providers of all OA's target content. Although not many funders mandate deposit either, the few that already do (about 30) can help wake the slumbering giant, because one funder mandate impinges on the research output of fundees at many different institutions. But there is a fundamental underlying asymmetry governing where funders should mandate deposit: As Prof. Bernard Rentier (founder of EuropenScholar and Rector of U. Liège, one of the first universities to adopt an institutional deposit mandate) has recently stressed, convergent funder mandates that require deposit in the fundee's own IR will facilitate the adoption of deposit mandates by institutions (the slumbering giant), whereas divergent funder mandates that require CR deposit (or are indifferent between CR and IR deposit) will only capture the research they fund, while needlessly handicapping (or missing the opportunity to facilitate) efforts to get institutional deposit mandates adopted and complied with too. The optimal solution for both institutions and funders is therefore: "Deposit institutionally, harvest centrally".
The well-meaning recommendations of Gavin Baker (GB) in “Authors: I don’t care where you deposit, just do it” are based on some implicit misunderstandings and unrealistic assumptions about the status quo today (OA < 15%) and what is practically required to get us from that status quo to 100% OA, at long last.
Locus-of-Deposit Matters for OA Mandates. Getting us to 100% OA at long last is precisely what OA deposit mandates are all about. And that in turn is what makes locus-of-deposit the crucial matter that Prof. Rentier has fully recognized it to be: The lesson for GB is that if you care about getting authors to deposit, you need to care about where you mandate deposit!
Symmetry of IR/CR InterHarvestability Is Irrelevant Without Deposits, and Deposit Mandates. GB makes the simple, indeed simplistic assumption of symmetry, insofar as harvestability is concerned: There are Institutional Repositories (IRs) and Central Repositories (CRs) and they are all interoperable, hence inter-harvestable: “What difference does it make where the OA content is deposited? It can be harvested in either direction!” The difference is very simple: Most of it is not deposited anywhere, and what we are talking about is choosing the fastest and surest policy to get it all deposited (and hence OA).
Next to No Deposits Without Deposit Mandates. Everyone (including GB) is at last coming to realize that the way to get OA's target content to be deposited and thereby made OA is by adopting deposit mandates, because most researchers (85%) are simply not depositing, whereas most researchers say they will deposit, willingly, if mandated; and it turns out that when it is indeed mandated, most researchers actually do deposit.
Who Should Mandate Deposit, Where? The apparent symmetry of cross-harvestability to/from IRs and CRs (once everything is indeed deposited) certainly does not answer these two questions. A closer examination of the question of who is to mandate deposit, and where discloses profound and interrelated asymmetries. The two candidates for mandating deposit are authors' employers (i.e., their institutions) and authors' funders. And the two candidate loci for deposit are the authors' own institutions (IRs) or various thematic or geographic collections (CRs).
Not All Research Is Funded, But All Research Is Institutional. There are tens of thousands of research-active universities, research institutions, and major R&D organizations worldwide. These are the "institutions" and they generate virtually all of OA's primary target content: refereed journal articles. There are also a few thousand major research funders, public and private, that fund some (but by no means all) research output.
Asymmetry I: Funder Mandates Only Cover Part of OA’s Target Content. And here we have the first profound asymmetry that GB overlooks: All of OA's target output is produced by institutions (unaffiliated researchers represent such a minute fraction of the total that it would be absurd to base overall OA policy on this tiny special case). But not all -- in fact perhaps not even most -- of OA's target output is funded. So if CRs are to be filled by funder mandates, they will not be filled with all and perhaps not even most of OA's target content even if all funders were to mandate it -- and, crucially, most funders still don't mandate deposit at all.
The “slumbering Giant”: Most Institutions Do Not Yet Mandate Deposit. The fact that most institutions don’t mandate is the practical gist of the entire locus-of-deposit issue: How does locus-of-deposit affect probability-of-mandate adoption (and compliance)? Without the mandates (and mandate compliance), we do not have the OA.
Funder Mandates Can Wake the slumbering Giant – If They Stipulate Institutional IRs As Locus-of-Deposit. Now the coverage-asymmetry we have already pointed out -- that all research originates from institutions but not all research is funded -- already suggests that there may not be parity between IRs and CRs: If funders were the only ones to mandate deposit, and they either stipulated that papers had to be deposited in a CR, or they left it open-ended whether they were deposited in a CR or an IR, then that would leave all unfunded (institutional!) content undeposited, and no form of CR-to-IR harvesting could fill the IRs with their missing content. This would remain true even if (mirabile dictu) all funders were to mandate deposit. It is the institution/funder, IR/CR coverage asymmetry, which is irremediable by harvesting.
Institutions Are the Universal OA Providers. Now note that the reverse is not true, and this is another aspect of the coverage-asymmetry: If (mirabile dictu) all institutions were to mandate deposit, then that would indeed generate 100% OA, and funders and CRs would indeed be able to harvest all their target content from the CRs. The trouble is, and remains, that all institutions do not yet mandate deposit; only a tiny number of them (about 30) do today.
Funder Mandates Reach and Bind Multiple Distributed Institutions. It is likewise true that only a tiny number of funders (also about 30) as yet mandate deposit. (Perfect symmetry so far.) But because the number of funders is far smaller than the number of institutions, and because some of the mandating funders are rather big ones, the actual proportion of mandating funders (as well as the amount and proportion of OA content that they cover) is substantially higher than the current proportion of mandating institutions (and the amount and proportion of OA content that they cover).
Asymmetry II: Funder Mandates Can Jump-Start Institutional Mandates – If Locus-of-Deposit Is Convergent Rather than Divergent. This is the second IR/CR asymmetry, and the most important one, for with a simple change in the stipulated locus-of-deposit of these few funder mandates (and future ones) -- from CR or IR/CR deposit to IR deposit -- funder mandates can, at no loss to themselves, help to jump-start a burst of IRs and IR mandates from the “slumbering giant,” the distributed worldwide set of institutions: the universal providers of all OA content. Every single funder mandate in fact touches on many institutions: the institutions of each one of its fundees. If funders mandate CR deposit only, or even if they leave it open whether deposit is in a CR or an IR, they miss the opportunity to systematically motivate and facilitate IR-creation and mandate-adoption by institutions. Suppose funders instead stipulate the fundee's own IR as the designated locus of deposit (allowing, where needed, the option of depositing in interim back-up repositories like DEPOT, [currently languishing with only 66 deposits!], explicitly designed to host deposits temporarily for institutions that have not yet set up IRs of their own, ready to be exported to the depositor's institution automatically as soon as they have set up their own IR):
Funders Motivate Institutions to Mandate and Authors to Comply. The result of funder mandates designation the fundee's IR as locus of deposit is that funders greatly increase the potential scope and power of their deposit mandates, far beyond only their own funded output: With institutional authors depositing funder-mandated papers in their own IRs, not only are authors thereby motivated to deposit the rest of their output in their IRs too, but their institution is motivated to capture the rest of its institutional output by mandating deposit too. (The slumbering giant awakes!)
Collaboration and Convergence Versus Competition and Divergence Between Institutional and Funder Mandates. If funder mandates thus collaborate and converge on locus-of-deposit with institutional mandates, by stipulating IR deposit (at the unitary local source: the universal provider), instead of competing and diverging needlessly on locus of deposit, nothing whatsoever is lost and everything is gained. Automatic harvesting is indeed there, ready to collect funder-mandated IR deposits into the funder's preferred CRs where desired, because the deposits themselves have been done: Institutions are well placed and only too willing to cooperate in monitoring and ensuring compliance with the grant fulfillment conditions of the funders of their researchers. And the more that institutions are involved in seeing to it that their researchers do their depositing in compliance with funder requirements, the sooner the token will fall for them, so they realize that the very same solution can and should be applied to all their institutional research output, funded and unfunded, in all areas, by adopting an institutional mandate too.
Institution/Funder Complementarity and Synergy in Reaching Universal OA. It is from the fundamental asymmetry underlying the question of locus-of-deposit and direction of harvesting that the institution/funder complementarity and synergy emerges: It has next to nothing to do with the symmetry of harvestability, given the deposits, and everything to do with ensuring that the deposits actually get done! (And this is without even mentioning the third asymmetry underlying the absurdity of institutions having to back-harvest their own institutional output because it has been deposited willy-nilly institution-externally instead of being deposited institutionally in the first place, and then automatically exported wherever else desired.)
I will reply to the rest of GB's posting via quote/commentary:
If requiring deposit is left to universities, their repositories will only contain publications by their researchers.
(1) But the total of all institutional research output is the total of all research output, hence 100% of OA’s target content. (“Institutions” includes all universities, research institutes, and major R&D organizations.)
(2) Not so for the total of all funded output.
(3) And the fundamental problem for OA today is that the institutions – the slumbering giant, the universal providers – do not yet require deposit. Only a tiny number of them do.
(4) If (as is simply assumed here) all institutions did mandate deposit for all of their research article output, we would have 100% OA.
(5) The issue, therefore, is: How can locus-of-deposit be used to help induce all institutions to mandate deposit?
Since some researchers have multiple institutional affiliations, and since any given publication may be authored by researchers across multiple institutions, it is easy to see that researchers will ultimately have to deposit their publications in as many repositories as there are institutions involved in their research.
We are talking about one, single locus of direct deposit, particularly for funder-mandated deposits, and whether that one deposit should be in the author’s own IR or a non-institutional CR, if the purpose is not just to deposit that one paper to make it OA, but to wake the slumbering giant to mandate depositing all institutional output to make it OA.
For an author affiliated with multiple institutions, that still only means a one-time IR deposit, in the IR of one of his institutions, followed by export to the IRs of the rest of his institutions. That has nothing to do with the question of symmetry for the locus of deposit of funder mandates that cover multiple institutions.
This trend seems to rest on the naive notion that, in the Internet era, it is somehow still necessary for researchers to conduct their work solely through the channels of a university.
It is not clear what futuristic scenario GB may have in mind, but the reality today is that researchers are employed and paid by their institutions to conduct and report their (funded and unfunded) research.
It is understandable that universities may wish to host a complete collection of the research published by their faculty, but nowadays that can easily be accomplished by importing it automatically from the more complete collections of the distributed Web.
Yes, just as manna from heaven could be harvested to feed the hungry! The trouble is, most research (85%) is not yet deposited anywhere on the distributed Web today, hence nowhere to be harvested to or from, because deposit has not yet been mandated. We are talking here about how locus-of-deposit, particularly for funder mandates, can increase the probability of waking the slumbering giant (institutions) to mandate deposit of all of their output, funded and unfunded, across all disciplines, institutions, languages and nations. That way we might one day indeed have a “complete collection” of OA research output globally.
Recall also that universities are not the universal providers of all research output. There will always be independent scholars, as well as publications by authors in government, non-profits and think tanks, and corporations.
Government, non-profits, think-tanks and corporations are all “institutions,” can all create their own (individual or consortial) IRs, and can mandate deposit in them. And funders can mandate deposit in those IRs by those of their fundees working at those institutions.
The OA philosophy is global. It cannot be reduced to a single university.
No one is reducing OA to a single university. OA content is distributed across a global network of institutions, the universal providers. And the idea is not reduction but expansion – of the practice and policy of depositing the research output of all those institutions in their own IRs, through collaborative, convergent rather than competing and divergent deposit mandates from both institutions and funders.
Institutions: Require that researchers ensure their publication ends up in your IR, wherever they initially deposited it. If they are accustomed to depositing in a funder or thematic repository, they’re responsible for working with the IR manager to ensure their work gets harvested. Better still if the IR allows users to set up automatic harvesting themselves, either for a single publication or for all the author’s future publications, if they don’t want to talk to the IR manager.
All splendid alternatives for institutions that already mandate deposit! But the trouble is that the giant is still slumbering, and we are talking about how to wake him. If he were already awake -- i.e., if all or most or even many institutions were already mandating deposit for all of their research output -- then locus-of-deposit would be far less crucial, and GB’s symmetry assumptions and alternatives would be perfectly fine.
The trouble is that that is not at all where we are today. The giant is fast asleep; indeed he has grotesquely overslept! And this panorama of alternative ways to do what he is not doing, nor thinking of doing, is not going to wake him up and induce him to do it.
Whereas convergent IR mandates from funders have a good chance of serving as the wake-up call...
Funders: Require that grantees ensure their publication ends up in your repository, wherever they initially deposited it. If they are accustomed to depositing in an institutional repository, they’re responsible for working with your repository manager to ensure their work gets harvested. Better still if the repository allows users to set up automatic harvesting themselves, either for a single publication or for all the author’s future publications, if they don’t want to talk to the repository manager.
In the current status quo, when only a tiny proportion of funders is mandating deposit at all, and an even tinier proportion of institutions is doing so, this advice is perhaps a tad premature... for existing IR content.
But it would certainly be a good idea to couple IR deposit mandates from funders with clear instructions on how the deposits may be automatically exported to the funder’s preferred CR (if any), as well as how compliance with the deposit mandate is to be verified as a fulfillment condition of the grant, with the collaboration of the fundee’s institution, IR software (for a funder-name metadata tag) and IR management.
This seems more flexible and accommodating to all preferences. In particular, it’s superior to the suggestion that funders should require authors to deposit in their institution’s repository, because  not every institution has an IR. Indeed,  not every funded researcher may even have an institution. The ultimate goal is opening all research, regardless of where the authors work or who funded the research.
 Every institution is just (a) a piece of free software, (b) a $1000 linux server plus (c) a few days of sysad set-up time from having its own IR. In the meanwhile, there are interim alternatives like the DEPOT or multi-institution consortial IR solutions like the Whiterose Consortium.
 Unaffiliated researchers are a red herring, when we are talking about a collaborative policy that will greatly enhance the probability of mandating deposit for the overwhelming majority of OA’s target output that is produced by researchers affiliated with an institution. Moreover, universities have a long and admirable tradition of hosting the work of unaffiliated scholars (and some IRs already do).
In summary: The symmetry between Institutional Repositories (IRs) and Central Repositories (CRs) with respect to harvestability the harvestability from one to the other (in either direction) is irrelevant today because most of the target content for Open Access (OA) is not yet being deposited at all. Convergent funder mandates that require IR deposit will facilitate the adoption of deposit mandates by institutions (the universal research providers), whereas divergent funder mandates that require CR deposit (or are indifferent between CR and IR deposit) will needlessly handicap (or fail to facilitate) efforts to get institutional deposit mandates adopted and complied with. The universal, synergistic solution for both institutions and funders is: Deposit institutionally, harvest centrally.
Gavin Baker comments:
"Every research funder should mandate OA to the research it funds.Noble wishes. But alas not being fulfilled -- and it's been a long wait. That's why specifying institutional locus-of-deposit for existing funder-mandates matters: to help make these wishes come true instead of just continuing to hope they will come true of their own accord.
"[P]ractice has shown that policy is significantly more effective than unenforceable exhortations to authors in general. So our question is not whether to mandate OA, but how"Indeed. But exhortations to mandate are not effective either! So this is an exhortation to make a specific modification in existing (funder) mandates to make them more effective in generating further (institutional) mandates.
"Where he’s wrong: Dr. Harnad repeats...“Not All Research Is Funded, But All Research Is Institutional”... he weakens this statement to 'virtually all'..."Not all researchers have fingers -- only "virtually all" do: That does not mean it cannot be mandated that the deposit keystrokes need to be done.
"[N]ot all academic journal literature is written by authors at colleges, universities, or research institutions"Fine. This changes nothing regarding the potential benefits of funder mandates stipulating IRs as the default locus of deposit. As noted, where the fundee has no IR, they can and should deposit in a back-up CR such as DEPOT.
"Institutional mandates won’t touch these [unaffiliated] authors (by definition). Funder mandates may touch some, though I’d suspect that most such research is unfunded"That makes this case even more irrelevant. This is about funder-mandates inducing fundees' institutions to adopt institution-mandates. No IR? Or no Instiutution? Deposit in the DEPOT -- and let's not let this minuscule number of exceptions obscure the potential benefits of funder mandates stipulating institutional deposit as a rule!
"(Harnad notes that some funders do offer “back-up” repositories for grantees without IRs, but without endorsing this flexibility.)"I did indeed endorse it, many, many times: And, in case of doubt, I endorse it here too.
"Harnad also deftly re-defines the traditional meaning of “institution” in “IR”, to mean “any organization whose employees might ever produce research”... As a result, he reduces further the number of “unaffiliated” researchers"Yes. And your point is...?
("Every institution is just (a) a piece of free software, (b) a $1000 linux server plus (c) a few days of sysad set-up time away from having its own IR. In the meanwhile, there are interim alternatives like the DEPOT...")
"Doesn’t it make more sense to tell your employees to deposit in a higher-visibility thematic repository?"Most definitely not. It makes immeasurably more sense to mandate that both fundees and institutional employees deposit in their own IR (or DEPOT) and let high-visibility CRs harvest the deposits thereafter, because that will generate more institutional mandates. That is in fact the very essence of the point I am making.
"My point about authors with multiple institutional affiliations also goes unheeded. Harnad suggests that this situation “only” requires one deposit, “followed by export to the IRs of the rest of his institutions”. Well, now we’re talking about export, which is exactly what I call for between IRs, thematic, and funder repositories. There’s no difference between exporting from one IR to another and between exporting from a thematic repository to an IR."That's the symmetry-fallacy again, which is exactly what I was refuting: There's no point "calling for" ("exhorting") the export of nonexistent (because undeposited) content. The point here is about funder-mandates designating institutional deposit in order to generate institutional-mandates (waking the "slumbering giant", "more bang for the research buck," or whatever you want to call it) so as to generate all that missing content!.
"Harnad also understates the resource requirements for adequately maintaining an IR. I’ll let Dorothea Salo answer this point."Umm -- I'd rather this point was answered by the managers of the few successful, well-stocked IRs that are actually doing what OA IRs are for -- rather than doing anything but (and at high cost)...
"Where I was wrong: Since more research comes out of institutions (as a whole) than is funded (as a whole), institutional mandates offer broader coverage."No, institutional mandates would offer broader coverage (indeed, universal coverage, if universally adopted). But only 30 have been adopted so far; and so what we are talking about here is how funder-mandates stipulating institutional deposit could induce more institutions to mandate deposit for the rest of their total output, funded and unfunded.
"(But a given funder may sponsor significantly more research than a given institution produces, and there are fewer funders than institutions, so they give more “bang for the buck” as initial targets.)"Again, the point is not that a funder generates more research than a single institution (though most do), but that a funder generates research at many institutions. That's what will give them "more bang for the buck" if they mandate IR rather than CR deposit. (And we are talking about making this small but important change in existing funder-mandates, not about just wishing for more funder-mandates.)
"[D]o funders have any leverage to encourage institutions to deploy IRs and mandate deposit? I am less sanguine about this than Harnad, but flooding IRs with deposits certainly could have the effect of building institutional constituencies for IRs and thus for institutional mandates."Sounds like you are and are not sanguine, Gavin!
And funders do have enormous "leverage" with their fundee institutions. But we are not talking about direct leverage to require fundee institututions to create IRs or adopt deposit mandates -- just the indirect leverage that would result from specifying IRs instead of CRs as the funder-mandate's default locus of deposit.
"Therefore: Is there some promise in using funder mandates to prod universities toward institutional mandates? Yes, there’s some. Is it a sure thing? No. Do funders still need to provide a “repository of last resort” to deal with exceptions? Yes."Agreed! So what are we arguing about?
(Yes, "repositories of last resort" do need to provided -- though it's not clear why they have to be provided by the funder; nor how many of them are really needed. There's plenty of room, for example, in DEPOT, which currently has only 66 deposits after almost two years of existence, simply because funders are mandating direct deposit in their own CRs instead of the fundee's IR -- for which DEPOT was specifically created to serve as the back-up.)
"Is it still easier for funders to monitor compliance and ensure adequate levels of service (including preservation) by requiring grantees to get their work into the funder’s repository, one way or another? Yes."Here's the way: import/export/harvest them from the IRs (and DEPOTs). And the fundee institutions will be more than happy to collaborate in monitoring and ensuring compliance with the funder's grant fulfillment conditions.
"Require grantees to deposit in their IR, if they have one. Require also that grantees ensure their publication ends up in your repository, wherever they initially deposited it. If they are accustomed to depositing in an institutional repository, they’re responsible for working with your repository manager to ensure their work gets harvested. Better still if the repository allows users to set up automatic harvesting themselves, either for a single publication or for all the author’s future publications, if they don’t want to talk to the repository manager."Yup, that's it! So what are we arguing about, Gavin?
"[Or] Encourage grantees to deposit in their IR, if they have one."Now that, in contrast, would be useless, just as encouraging ("exhorting," "calling for," "wishing") authors to deposit -- or funders or institutions to mandate -- is useless. What we are talking about is a tiny but concrete and specific change in the implementational details of actual, adopted funder mandates, so as to require institutional (or, as last resort, DEPOT) deposit, in order to generate more institutional mandates. Merely "encouraging" it will generate nothing.
"[T]he risk of backlash if IRs are poorly managed, a factor over which the funder can have little control..."Why are we speculating about the possibility of full, ill-managed IRs when what we are actually faced with is virtually all IRs (whether well-managed or ill) empty! Necessity is the mother of invention: Fill those IRs with their valuable intended content, and institutions will nurture them as the invaluable assets they will prove to be. Keep fussing instead about the current or future management of idle or nonexistent IRs and you will have years more of idle or nonexistent IRs.
"Funder repositories come with their own cost: the increased potential for government interference in science that comes with greater centralization"Let's not get carried away! We're talking about published journal articles, made openly accessible, free for all. If they are deposited in each researcher's distributed IRs, they can thence be harvested into multiple CRs. If there is government "interference" (whatever that means) in governmental CRs, there will be plenty of other harvested CRs (e.g. google scholar!) where those came from.
American Scientist Open Access Forum
Display comments as (Linear | Threaded)
Syndicate This Blog
Materials You Are Invited To Use To Promote OA Self-Archiving:
The American Scientist Open Access Forum has been chronicling and often directing the course of progress in providing Open Access to Universities' Peer-Reviewed Research Articles since its inception in the US in 1998 by the American Scientist, published by the Sigma Xi Society.
The Forum is largely for policy-makers at universities, research institutions and research funding agencies worldwide who are interested in institutional Open Acess Provision policy. (It is not a general discussion group for serials, pricing or publishing issues: it is specifically focussed on institutional Open Acess policy.)
You can sign on to the Forum here.