Opened 17 years ago
Closed 17 years ago
#35 closed task (fixed)
Port .lgf related tools
Reported by: | Alpar Juttner | Owned by: | Balazs Dezso |
---|---|---|---|
Priority: | blocker | Milestone: | LEMON 1.0 release |
Component: | core | Version: | hg main |
Keywords: | Cc: | ||
Revision id: |
Description
This involves the .lgf
specification and the reader and writer classes (GraphReader
,GraphWriter
, LemonReader
and LemonWriter
)
They also need a thorough revision, the .lgf
specification must be clarified and simplified a lot.
Attachments (12)
Change History (37)
comment:1 Changed 17 years ago by
comment:2 follow-up: 3 Changed 17 years ago by
I agree with Alpar, that the new format should be readable without knowing the input method. But I think format suggested by him is to restrictive.
In my opinion the format should look like that:
- It contains well parsed expressions from (), {}, []
- In each inner space between parses could be any sequence of characters and strings
- The outer spaces should not contain whitespace out of a string.
- The string representation is a possible escaped character sequence surrounding "" or
comment:3 Changed 17 years ago by
Replying to deba:
I agree with Alpar, that the new format should be readable without knowing the input method. But I think format suggested by him is to restrictive.
Could you show some examples where it is really restrictive?
As far as I understand, your specification is more flexible than mine only if both of the followings hold:
- The value is a properly bracketed expression,
- It contains white-spaces, but only inside a bracket.
Even in this case the only thing you can save is a pair of double quotes around the value. Is that really worth the increased complexity? Remember, if we define a more complex rule for the column separation and reconstruction, then all tools working with .lgf files must implement this rule.
comment:4 Changed 17 years ago by
Owner: | changed from Alpar Juttner to Balazs Dezso |
---|
comment:5 Changed 17 years ago by
Peter and Balazs,
Could you summarize here the conclusion of our private discussion on this topic?
comment:6 Changed 17 years ago by
Priority: | major → blocker |
---|
Changed 17 years ago by
Attachment: | lgf_io-single.patch added |
---|
The changesets of lgf_io.hg merged into a single one.
comment:8 Changed 17 years ago by
Changed 17 years ago by
Attachment: | lgf_io.bundle added |
---|
comment:9 Changed 17 years ago by
I have uploaded a new bundle for lgf io. In my opinion it is close to the final version, but it should be tested and documented. On the writer side some runtime checking would be also fine.
comment:10 Changed 17 years ago by
The changesets in attachment:lgf_io.bundle has been merged into a single changeset and it is now in the main branch, see [1c9a9e2f7d4d].
The undirected versions are still unimplemented.
comment:11 Changed 17 years ago by
Version: | → hg main |
---|
Changed 17 years ago by
Attachment: | digraph_reader_doc.patch added |
---|
comment:12 Changed 17 years ago by
I have just now uploaded a patch with the preliminary documentation of the DigraphReader?. Please fix it, improve it, advance it. If this documentation is finalized, the documentation of DigraphWriter? class could be made also.
Changed 17 years ago by
Attachment: | digraph_reader_doc.2.patch added |
---|
comment:13 Changed 17 years ago by
Is there any reason/advantage to use a different column separation rule for the header line of @nodes/@edges sections and for their contents?
If we used the same, then
- the code would be easier
- the file format would be more homogeneous, therefore easier to describe and document,
- it would be flexible (but still compatible)
than what we have now. For example, it would allow space in the column names. Currently, the reader must check if the column name is a single word, and all of the software working with column names given by the user (such as glemon) should also do so.
Changed 17 years ago by
Attachment: | lgf_doc.patch added |
---|
comment:14 follow-up: 15 Changed 17 years ago by
I uploaded a new patch with solution for Alpar issue (but I do not agree with him completely), and with the documentation of DigraphReader? and DigraphWriter?.
comment:15 follow-up: 16 Changed 17 years ago by
Replying to deba:
I moved the description of the lgf format into a separate page, and also made several improvements on this text, as well as on the doc of the DigraphReader and DigraphWriter.
comment:16 Changed 17 years ago by
I merged attachment:digraph_reader_doc.2.patch, attachment:lgf_doc.patch and attachment:lgf-page.patch into a single changeset [e561aa7675de], and put it into the main branch.
Can we close this ticket? Note that we have another related issue (#91).
comment:17 follow-up: 18 Changed 17 years ago by
I mean, the undirected graph IO should be implemented, or a new ticket should opened for this. What do you think about its implementation, could we apply exhaustively the copy-paste, or do you have any other "good" idea?
comment:18 Changed 17 years ago by
Resolution: | → fixed |
---|---|
Status: | new → closed |
Changed 17 years ago by
Attachment: | lgf_bug_fix.patch added |
---|
Changed 17 years ago by
Attachment: | lgf_demo.patch added |
---|
comment:19 follow-up: 20 Changed 17 years ago by
Resolution: | fixed |
---|---|
Status: | closed → reopened |
I have uploaded to patches.
The first mainly solves some bugs:
- Fixing function interface for lgf readers and writers
- The characters under 0x20 should be escaped
The second patch contains a better demo file.
comment:20 Changed 17 years ago by
Resolution: | → fixed |
---|---|
Status: | reopened → closed |
Replying to deba:
I have uploaded to patches.
The first mainly solves some bugs:
- Fixing function interface for lgf readers and writers
- The characters under 0x20 should be escaped
The second patch contains a better demo file.
They went to the main branch, see [c82fd9568d75] and [00d297da491e].
Changed 17 years ago by
Attachment: | 3d9971265677.patch added |
---|
Changed 17 years ago by
Attachment: | b026e9779b28.patch added |
---|
comment:21 follow-up: 22 Changed 17 years ago by
Resolution: | fixed |
---|---|
Status: | closed → reopened |
I have uploaded two changsets:
- [3d9971265677] makes some clarification in the usage of skipSection()
- [b026e9779b28] ports the ContentReader? as LgfContent? class
comment:22 Changed 17 years ago by
Resolution: | → fixed |
---|---|
Status: | reopened → closed |
Replying to deba:
I have uploaded two changsets:
They went to the main branch.
- [3d9971265677] makes some clarification in the usage of skipSection()
Probably you meant [c94a80f38d7f] here.
- [b026e9779b28] ports the ContentReader? as LgfContent? class
Changed 17 years ago by
Attachment: | 9159de5e9657.patch added |
---|
comment:23 follow-up: 24 Changed 17 years ago by
Resolution: | fixed |
---|---|
Status: | closed → reopened |
The [9159de5e9657] solves that the label map is not necessary, and the map names line is necessary, if no map is read and there is no item in the item set.
comment:24 Changed 17 years ago by
Replying to deba:
The [9159de5e9657] solves that the label map is not necessary, and the map names line is necessary, if no map is read and there is no item in the item set.
Accepted.
comment:25 Changed 17 years ago by
Resolution: | → fixed |
---|---|
Status: | reopened → closed |
Now, I start a discussion here, how the
.lgf
format should look like.First of all my basic principle is
Now, let's consider a single block (the inside of a @nodeset or @edgeset). This part of the file consists of columns. Each column is a string (i.e. sequence of characters), independently of how it is interpreted when it is read. My (second) principle here is
My rules for the column separation would be as follows.
\n
,\"
etc). Then the interpretation of the string would be the same as the standard C strings.More complex rules can also be worked out, but I think this simple one is comfortable enough for all purpose.
Once we agreed on this, we can continue with discussing how to handle
@edgesets
,and also how the read/write API should look like.