Skip to content

Rhaptos Software Development

Personal tools
You are here: Home » Developer Blog » Brian's Sooth » LaTeX Importer - Bibliographies

LaTeX Importer - Bibliographies LaTeX Importer - Bibliographies

Document Actions
Submitted by bnwest. on 2007-11-02 11:29. Development
Shows how the forms of LaTeX bibs get translated by tralics and then by us into CNXML.

We have two forms of bibs in LaTeX.  Each get different handling during the conversion into CNXML.


1. \begin{thebibliography}...\end{thebibliography} with bib cites as "[n]"

\begin{thebibliography}

\bibitem{CandesRUP}

E.~Cand\`{e}s, J.~Romberg, and T.~Tao, ``Robust uncertainty principles: {E}xact

  signal reconstruction from highly incomplete frequency information,''

  \emph{IEEE Trans. Inform. Theory}, vol.~52, no.~2, pp. 489--509, Feb. 2006.

...

\bibitem{Nowak}

J.~Haupt and R.~Nowak, ``Signal reconstruction from noisy random projections,''

  \emph{IEEE Trans. Inform. Theory}, vol.~52, no.~9, pp. 4036--4048, Sept.

  2006.

\end{thebibliography}

Note that the LaTeX is free formed, i.e. the bib components are not tagged with their semantic entity like author, publication, year, etc.

tralics translates that into

<Bibliography>

<p>bib entry</pc>

...

<p>bib entry</pc>

</Bibliography>

We translate the above into the following CNXML:

      <list id="id2256574" type="enumerated">
        <name>Bibliography</name>
        <item id="bid11">
D. Achlioptas, Database-friendly random projections, Proc.
ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems
(2001), pp. 274–281.</item>
      ...

        <item id="bid13">S. Mendelson, A. Pajor, and N. Tomczack-Jaegermann,Reconstruction and subgaussian operators in AymptoticGeometric Analysis, (2006), Preprint.</item>

      </list>

We also map the bib ids to their cites in the document body.

2. \bibliography{p2bib} with bib cites as "[author,year]"

where p2bib.bib is a stand alone file contains bib entries like

@inproceedings{acimovic05,

    author={J. A\'{c}imovi\'{c} and R. Cristescu and B. Beferull-Lozano},

    title={Efficient Distributed Multiresolution Processing for Data Gathering in Sensor Networks},

    booktitle={Proc. {IEEE} Int. Conf. on Acoustic and Speech Sig. Proc. (ICASSP)},

    year={2005},

    month={Mar.},

    pages={IV-837---IV-840}

}

@book{selin-95,

   author = {Ivan Selin},

   title = {Detection Theory},

   publisher = {Princeton University Press},

   address = "Santa Monica, CA",

   year = "1965",

}

Note that the bib components are semantically tagged.

tralics trnslates this into

<biblio>
<citation from='year' key='ACBL05' id='bid7' userid='cite:acimovic05' type='inproceedings'>
<bauteurs><bpers prenom='J.' nom='A&#x107;imovi&#x107;' prenomcomplet='J.'/><bpers prenom='R.' nom='Cristescu' prenomcomplet='R.'/><bpers prenom='B.' nom='Beferull-Lozano' prenomcomplet='B.'/></bauteurs>
<btitle>Efficient Distributed Multiresolution Processing for Data Gathering in Sensor Networks</btitle>
<bbooktitle>Proc. IEEE Int. Conf. on Acoustic and Speech Sig. Proc. (ICASSP)</bbooktitle>
<bpages>IV-837&#x2014;IV-840</bpages>
<bmonth>Mar.</bmonth>
<byear>2005</byear>
</citation>

...

</biblio>

tralics keeps the semantics. We translate this into the following CNXML:

      <list id="id2254928" type="enumerated">
        <name>Bibliography</name>
        <item id="bid7">
J. Aćimović, R. Cristescu, B. Beferull-Lozano,
Efficient Distributed Multiresolution Processing for Data Gathering in Sensor Networks.
Proc. IEEE Int. Conf. on Acoustic and Speech Sig. Proc. (ICASSP).
pp. IV-837—IV-840.
Mar.
2005.
</item>

...

      </list>

We lose the semantics for CNXML.  I am unsure if thetralics XML for this bib is bibtexml or not.  If it is, we can trivially translate the bibtxml into CNXML, which supports bibtxml.  If it is not, we still should be able to translate into bibtexml since all the semantics are present.









Developer Blog
« July 2008 »
Su Mo Tu We Th Fr Sa
    1 2 3 4 5
6 7 8 9 10 11 12
13 14 15 16 17 18 19
20 21 22 23 24 25 26
27 28 29 30 31    
2008-07-02
12:53-12:53 Performance tests of module PDF generation
Categories:
Content (55)
Copyright (0)
Deep Code (3)
Development (198)
Markup (22)
Metadata (1)
Printing (7)
Style (9)
Testing (2)
Usability (6)