/ Zope / WHI Hosted Mailing Lists / osis-user / Archive / 2004 / 2004-06 / OSIS 2.0 User Doc

[ << ] [ >> ]

[ pollo.sourceforge.net - finally an editor with ... ] [ RE: Erado UCEI: 4.1 RE: [osis-user] OSIS 2.0 User ... ]

OSIS 2.0 User Doc
"Thomas BOEHME" <catb(at)gmx.net>
2004-06-07 08:26:50 [ FULL ]
Maybe some of my confusion stems from the fact that I only have a 1.5
version of the User Guide, some parts of which are still to be fleshed out,
whereas the OSIS docs on the web site are mostly 1.1.1, while you think in
2.0 terms and have the advantage of having been at the meetings. So, the
most obvious question would then be: Is there (a draft of) a OSIS 2.0 User
Guide available somewhere?

One thing that I want to get my head around now is the whole issue of
continuing elements. The way I read the User Guide (1.5) and the header of
the osisCore.2.0.1.xsd.xml, these exist because paragraphs and quotations
and the like, may cross chapter and verse boundaries. But really I would
like to make a strong case for leaving chapters and verses atomic (ie. with
proper start and end tags and the content in between). After all, these are
the primary ordering criteria (including their encompassing divisions). XML
should in the first place be about content, and only in second order deal
with presentation (which I realize is important too for the overall purposes
of OSIS). This way, "chapter" can also remain a type of division as before
(or were there other reasons for making it a separate type?). I think making
both organizational elements (chapter, verse, div) *and* presentational
elements (q, lg, etc) milestoneable is rather odd. Or is it just me you
can't follow the argument that book/chapter/verse is always derviable from
div/p? Why isn't it possible to make p milestoneable instead of
div/chapter/verse?

Can someone shed some light on this for us/me uninitiated?

Thanks
Thomas Boehme

Re: [osis-user] OSIS 2.0 User Doc
Jim_Albright(at)wycliffe.org
2004-06-07 09:43:24 [ FULL ]
There are several views of scripture. Each is equally valid.
Perspective: "The XSEM project has chosen to use the perspective that 
comes from the task of publishing Scripture as the primary perspective in 
the encoding model." Other perspectives are modeled as possible, although 
not always in the principal hierarchy: "The Canonical view [books contain 
chapters that contain verses]; The Outline view [sections contain sections 
that contain paragraphs]; The Literary view [a collection of prose and 
poetry -- here are various types of prose i.e., narrative, letters, speeches];
The Speech/Discourse view [a series of 
nested speech segments, that is, direct and indirect quotes]; The 
Linguistic view [sentences contain phrases that are made of parts of 
speech, such as nouns, verbs and adjectives]; The Source view [a series of 
witnesses and renderings with one chosen as primary]."
As an experiment my son Eric created a milestone view for each element and 
was able to transform from one primary view to another though the 
milestoned view.  Since publication is my principle need, the publishing 
view is best for me.


Jim Albright
704 843-0582
Wycliffe Bible Translators






"Thomas BOEHME" <catb(at)gmx.net>
06/07/2004 08:26 AM
Please respond to osis-user

 
        To:     <osis-user(at)whi.wts.edu>
        cc: 
        Subject:        [osis-user] OSIS 2.0 User Doc


Maybe some of my confusion stems from the fact that I only have a 1.5
version of the User Guide, some parts of which are still to be fleshed 
out,
whereas the OSIS docs on the web site are mostly 1.1.1, while you think in
2.0 terms and have the advantage of having been at the meetings. So, the
most obvious question would then be: Is there (a draft of) a OSIS 2.0 User
Guide available somewhere?

One thing that I want to get my head around now is the whole issue of
continuing elements. The way I read the User Guide (1.5) and the header of
the osisCore.2.0.1.xsd.xml, these exist because paragraphs and quotations
and the like, may cross chapter and verse boundaries. But really I would
like to make a strong case for leaving chapters and verses atomic (ie. 
with
proper start and end tags and the content in between). After all, these 
are
the primary ordering criteria (including their encompassing divisions). 
XML
should in the first place be about content, and only in second order deal
with presentation (which I realize is important too for the overall 
purposes
of OSIS). This way, "chapter" can also remain a type of division as before
(or were there other reasons for making it a separate type?). I think 
making
both organizational elements (chapter, verse, div) *and* presentational
elements (q, lg, etc) milestoneable is rather odd. Or is it just me you
can't follow the argument that book/chapter/verse is always derviable from
div/p? Why isn't it possible to make p milestoneable instead of
div/chapter/verse?

Can someone shed some light on this for us/me uninitiated?

Thanks
Thomas Boehme


[...]

Re: [osis-user] OSIS 2.0 User Doc
"Thomas BOEHME" <catb(at)gmx.net>
2004-06-07 10:36:20 [ FULL ]
I think I see the point you are trying to make, but don't you agree that
allowing milestones on the organizational/canonical level as well as the
content level is a bit of a mishmash? Exactly for the same reason that all
models can be transformed into each other (thanks to milestones) I would
rather make one view the primary one and go with it. Maybe all I am trying
to say is: let's make the canonical one the primary one. This would leave
developers and users less room for interpretation which I think is a good
idea.

Cheers,
Thomas

----- Original Message ----- 
From: <Jim_Albright(at)wycliffe.org>
To: <osis-user(at)whi.wts.edu>
Sent: 07 June 2004 15:42
Subject: Re: [osis-user] OSIS 2.0 User Doc

[...]
speeches]; The Speech/Discourse view [a series of[...]
osis-user(at)whi.wts.edu.[...]

Re: [osis-user] OSIS 2.0 User Doc
Jim_Albright(at)wycliffe.org
2004-06-07 12:19:52 [ FULL ]
There can only be one view in focus at a time. If you need to be able to 
have others in focus then possibly OSIS should make 4 or 5 OSIS models so 
everyone can be happy. Transforming from one view to another then would be 
trivial. So why do you favor canonical view. From a translation point of 
view (and possibly from interpretation view) narrowing the viewers focus 
to just a verse may miss the proper meaning of a section.

Jim Albright
704 843-0582
Wycliffe Bible Translators






"Thomas BOEHME" <catb(at)gmx.net>
06/07/2004 10:35 AM
Please respond to osis-user

 
        To:     <osis-user(at)whi.wts.edu>
        cc: 
        Subject:        Re: [osis-user] OSIS 2.0 User Doc


I think I see the point you are trying to make, but don't you agree that
allowing milestones on the organizational/canonical level as well as the
content level is a bit of a mishmash? Exactly for the same reason that all
models can be transformed into each other (thanks to milestones) I would
rather make one view the primary one and go with it. Maybe all I am trying
to say is: let's make the canonical one the primary one. This would leave
developers and users less room for interpretation which I think is a good
idea.

Cheers,
Thomas

----- Original Message ----- 
From: <Jim_Albright(at)wycliffe.org>
To: <osis-user(at)whi.wts.edu>
Sent: 07 June 2004 15:42
Subject: Re: [osis-user] OSIS 2.0 User Doc

[...]
in[...]
although[...]
contain[...]
sections[...]
speeches]; The Speech/Discourse view [a series of[...]
of[...]
and[...]
in[...]
User[...]
of[...]
quotations[...]
deal[...]
before[...]
from[...]
osis-user(at)whi.wts.edu.[...]

[...]

Re: [osis-user] OSIS 2.0 User Doc
Chris Little <chrislit(at)crosswire.org>
2004-06-08 03:21:37 [ FULL ]
On Mon, 7 Jun 2004, Thomas BOEHME wrote:
[...]

There is a set hierarchy model for all OSIS Bibles.  That hierarchy model 
is book (<div type="book">), section (<div type="section">),
paragraph 
(<p>).  None of these elements are milestonable.

I don't think anyone particularly lamented not using book (<div 
type="book">), chapter (<chapter>), verse (<verse>) as the
primary model 
since this is a fairly hideously flawed model.  I think one of our guiding 
principles was the old maxim that easy/common stuff should be easy and 
hard stuff should be possible.  Bibles in which paragraphs cross 
verse/chapter boundaries and verses cross chapter boundaries would be 
impossible to encode if we had gone with un-milestonable <chapter> and 
<verse> elements.

We respect your desire to keep things easy, and that is why you can still
encode Bibles using only containers for book, section, paragraph, chapter,
and verse.  But we could not limit functionality so that more complex 
Bibles (like most real, modern, print Bibles are) could not be encoded 
correctly.

--Chris

Re: [osis-user] OSIS 2.0 User Doc
"Thomas BOEHME" <catb(at)gmx.net>
2004-06-08 11:07:56 [ FULL ]
Chris,

Sounds like there is dark veil across my eyes... What exactly is the
difference between (equally applicable to chapters):

<p>
<v>....</v>
<v sID="123"/>...
</p>
<p>
....
<v eID="123"/>
<v>...</v>
<p>

and this one:

<p sID="abc"/>
<v>...</v>
<v>...<p eID="abc"/><p sID="xyz"/>...</v>
<v>...</v>
<p sID="xyz"/>

Something similar actually is in some of the OSIS documents on the website
coded as <milestone type="x-p"/>.

For your info: I have the programmer (= user) perspective on this, not the
creator perspective - if that makes a difference to your explanation. I will
never create a bible, so the argument that I *could* encode my bible this
way if I wanted to doesn't matter to me. I would to deal with *your* (?)
bibles in the best way possible.

The reason why I favor a canonical approach is that for online bibles the
two most common ways of access is through a) scripture reference or through
b) word search. For the latter, clearly the model doesn't matter, but for
the former atomic verses are rather advantageous. For example, in a
parallel/synoptic view, paragraphs are likely to be different for each
translation, but verses (flawed as they are) are what unites them and keeps
the views in sync.

What you describe (correct me if I am wrong) is the generation of printable
output from the model, which is pretty straight forward either way as you
can just generate the output from the model as you stream it (eg. using a
SAX parser), but jumping into the model from an arbitrary point is what I am
concerned about (yes, pretty single-minded on my behalf). Some newer SAX
parsers in Java allow for deferred instantiation of the model and
milestonable verses and chapters ruin this scheme for the most part. So, in
my view, for bible "printers" either way should be OK and renders the same
result, for online access, the canonical version is advantageous which is
why I suggest to make that the primary model.

Cheers,
Thomas


----- Original Message ----- 
From: "Chris Little" <chrislit(at)crosswire.org>
To: <osis-user(at)whi.wts.edu>
Sent: 08 June 2004 09:27
Subject: Re: [osis-user] OSIS 2.0 User Doc

[...]
all[...]
trying[...]
leave[...]
good[...]
osis-user(at)whi.wts.edu.[...]

Re: [osis-user] OSIS 2.0 User Doc
Chris Little <chrislit(at)crosswire.org>
2004-06-09 05:05:16 [ FULL ]
On Tue, 8 Jun 2004, Thomas BOEHME wrote:
[...]
*snip*

In theory, there's not much difference between assigning priority to 
structural description (like <p>) and the BCV hierarchy.  We considered 
both options extensively and chose the structural option partly because it 
was in use by a large set of our users already, partly because it is based 
on actual content in light of centuries of scholarship rather than 
traditions with demonstrable errors, and partly because everyone agreed 
that BCV can easily be extracted from the structural system, but not 
vice versa.
[...]

This is quite simply bad markup.  Any time an "x-" value is used, that 
means it is not part of the standard.  In this case specifically, we have 
an example of VERY BAD markup.  This is exactly the opposite of how 
paragraphs are supposed to be marked up, and was done in spite of the 
document's encoder having been told that this is bad encoding.  Any 
documents abuse the standard in this manner are unlikely to be useful to 
anyone aside from their own encoders and should certainly never be 
released publicly.
[...]

Why does the content model matter?  You can still extract portions of a 
document by BCV reference, using the osisID attribute on book, chapter, & 
verse containers.
[...]

I use OSIS from an encoder's and a programmer's perspective.  I'm not
involved in print publishing at all, though certainly others are.
Book/section/paragraph works well for me precisely because I can still
always grab a section of a document using references on the osisID if I 
need to, but the content model sees Bibles as documents rather than just 
databases with verses for records.

Not to be discouraging, and not that I have any more power than to make 
suggestions, but I don't think there's any chance that we will change our 
basic content model.  The standard is already fairly mature, and there are 
already many applications and encoded documents--all of which would become 
invalid if such a huge change were made.

--Chris

RE: [osis-user] OSIS 2.0 User Doc
"Todd Tillinghast" <todd(at)contentframeworks.com>
2004-06-09 12:23:40 [ FULL ]
To add to what Chris said.

First, the main users/proponents of the BCV hierarchy are on the
software side where they are trying to extract only a range of text
based on a scripture reference.  When faced with the reality that if we
allowed a BCV hierarchy they would ALSO have to support the literary
hierarchy the option becomes clear that we are all better off with fewer
options.


I encoded a few texts both ways and in general the BCV hierarchy is much
simpler to encode as milestones because there are much more complex
structures present in the literary hierarchy.  For example <div
type="book"><div
type="section><list><item><p>text<q
type="block">....
but even the more common case of <div type="book"><div
type="section"><p> has a number of children that can cross the verse
and
chapter boundaries.  So faced with a decision the standards body opted
for the simpler hierarchy to take on the milestone form.

Todd
[...]
considered[...]
because it[...]
based[...]
agreed[...]
have[...]
to[...]
bibles[...]
but[...]
each[...]
and[...]
a[...]
chapter, &[...]
as[...]
using[...]
what[...]
SAX[...]
So,[...]
the[...]
which[...]
I[...]
just[...]
make[...]
our[...]
are[...]
become[...]

RE: [osis-user] OSIS 2.0 User Doc
"Thomas BOEHME" <catb(at)gmx.net>
2004-06-09 14:31:55 [ FULL ]
Even without knowing any of the history, I do realize that <milestone
type="x-p"/> is bad encoding. That is why I said "like"... 

Well, maybe this all comes back to my earlier request for some more complete
sample documents. Maybe I have to see one of those properly encoded bibles
to realize why the content matters more. Having twenty bibles or so myself,
I still fail to understand why the content is less problematic than BCV with
all its "demonstrable errors". I don't think any two of my bibles have the
same headings, render the same things as quotes etc. But I think I will
leave it at that since I don't want to bog down this group with my seemingly
naïve questions.

Let me just make one final remark: doesn't it strike you as odd that with
every example I cited from public OSIS documents (including those from the
official OSIS website) you are telling that it is using bad encoding? In my
view (I deal a lot with XML in my everyday job) that says a lot about the
document *and* the markup.

I hope you don't get me wrong: I appreciate the effort. All I am trying to
add are 2cts programmer (and purist) wisdom. I really would like to help
make OSIS a success.

Cheers,
Thomas

-----Original Message-----
From: Chris Little [mailto:chrislit(at)crosswire.org] 
Sent: 09. June 2004 11:11
To: osis-user(at)whi.wts.edu
Subject: Re: [osis-user] OSIS 2.0 User Doc

On Tue, 8 Jun 2004, Thomas BOEHME wrote:
[...]
*snip*

In theory, there's not much difference between assigning priority to
structural description (like <p>) and the BCV hierarchy.  We considered
both
options extensively and chose the structural option partly because it was in
use by a large set of our users already, partly because it is based on
actual content in light of centuries of scholarship rather than traditions
with demonstrable errors, and partly because everyone agreed that BCV can
easily be extracted from the structural system, but not vice versa.
[...]

This is quite simply bad markup.  Any time an "x-" value is used, that means
it is not part of the standard.  In this case specifically, we have an
example of VERY BAD markup.  This is exactly the opposite of how paragraphs
are supposed to be marked up, and was done in spite of the document's
encoder having been told that this is bad encoding.  Any documents abuse the
standard in this manner are unlikely to be useful to anyone aside from their
own encoders and should certainly never be released publicly.
[...]

Why does the content model matter?  You can still extract portions of a
document by BCV reference, using the osisID attribute on book, chapter, &
verse containers.
[...]
suggest to make that the primary model.

I use OSIS from an encoder's and a programmer's perspective.  I'm not
involved in print publishing at all, though certainly others are.
Book/section/paragraph works well for me precisely because I can still
always grab a section of a document using references on the osisID if I need
to, but the content model sees Bibles as documents rather than just
databases with verses for records.

Not to be discouraging, and not that I have any more power than to make
suggestions, but I don't think there's any chance that we will change our
basic content model.  The standard is already fairly mature, and there are
already many applications and encoded documents--all of which would become
invalid if such a huge change were made.

--Chris




--
To unsubscribe send an email with subject 'unsubscribe' to
osis-user(at)whi.wts.edu.
Please contact klowery(at)whi.wts.edu for questions.
http://127.0.0.1:9673/mailingLists/osis-user/archive/2004/2004-06/1086611210
250/1086771916276

MailBoxer