Page 37 -
P. 37

4  Suggestions for Further Research

                   The data herein rely heavily upon a keyword identification of clean energy patents that
                   focus on late-stage innovations.  Future work needs to update and make available
                   identification and categorization using a more general and flexible ontology comprised of
                   a mixture of keyword, patent classification code, and assignee-based identification that is
                   simultaneously inclusive of earlier-stage technologies but still accurate.


                   Significant work will need to be undertaken to further validate the Web-presence
                   methodology.  It would be extremely difficult to do this for the complete patent dataset
                   due to the labor intensity of determining whether each patent has demonstrated
                   commercial value.  However, future work could include other institutions’ licensed
                   versus unlicensed patents as a method of calibrating this method.  Qualitative surveys
                   with active inventors will enrich this line of analysis.  Lines of inquiry may include the
                   inventor’s qualitative assessment of the patent’s value, the method of commercialization
                   (start-up, license to large firm, license to non-profit, not deployed and why), and the level
                   of commercial success.  Additional focus on refining data quality around regional,
                   organizational, collaborative, and institutional factors, as well as the funding sources for
                   the research (particularly if the research was supported by a government grant) would be
                   helpful.  Also, further effort should be expended in the automated analysis of the content
                   of each type of website.  A variety of schemes could be investigated [87], including
                   simple search term counts, dictionary look-up of appropriate terms, and semi-supervised
                   machine learning algorithms (at least for the patents with enough hits apply such methods
                   [88]).


                   We hope that this work will contribute to an open platform for future research by the
                   community of science and policy scholars.  The focus in this proposal on clean energy is
                   just one slice of this effort.  We hope to build, through the collaborative work of many
                   scholars, a series of integrated databases that would enable the tracing of scientific
                   investment from grants all the way to commercialization.  Figure 11 illustrates an
                   idealized schematic of this goal.  This effort would help accelerate the development of a
                   system where most grants could be traced to papers, papers to papers through citations,
                   papers to patents through people and citations, and patents to commercial outcomes
                   through people (for example, on boards of directors), licensing agreements, and the
                   proposed Web-based measure of commercial impact and deployment.  This would be a
                   powerful tool for both DOE and the research community to better assess the value and
                   success of research and development investments. This capability will be essential as the
                   market quickly evolves, research investments increasingly come from the private sector,
                   and there is a call for increased transparency on public research expenditures.  Further
                   development of this database and the addition of new datasets outside our control through
                   an open-research model will enable this tool to be broadly used to investigate innovation,
                   commercialization, and deployment for DOE and the public sector in general.












                                                             30
   32   33   34   35   36   37   38   39   40   41   42