Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Propose to remove the term metadata from the draft. #66

Open
adammontville opened this issue Dec 10, 2017 · 11 comments
Open

Propose to remove the term metadata from the draft. #66

adammontville opened this issue Dec 10, 2017 · 11 comments

Comments

@adammontville
Copy link
Contributor

At its heart, metadata is simply "data about data", and there really isn't much more to say. There is a bunch of expositional text associated with the term in the draft:

In the SACM information model, data is referred to as Content. Metadata about the content is referred to as Content-Metadata, respectively. Content and Content-Metadata are combined into Subjects called Content-Elements in the SACM information model. Some information elements defined by the SACM information model can be part of the Content or the Content-Metadata. Therefore, if an information element is considered data or data about data depends on which kind of Subject it is associated with. The SACM information model also defines metadata about the data origin via the Subject Statement-Metadata. Typical examples of metadata are time stamps, data origin or data source.

This information isn't really helpful to the definition, and because metadata is a very common term and widely understood as data about data, I propose we remove the term from this draft.

@adammontville adammontville added bug and removed bug labels Dec 10, 2017
@henkbirkholz
Copy link
Member

I would like to rephrase it into a question and a meta-question:

Is the definition helpful to understand the term metadata and its usage of the sub-types wrt implementors or other readers of the document?

I am basically neutral. If not defined here, it has to be elaborated on where it is used most, which is in the information model, I think. I am slightly in favor of keeping both definition and context here in this doc and have a redundant paragraph about when or how a IE type is used to represent metadata and when it represents content, in the IM too.

@strazzie123
Copy link
Collaborator

strazzie123 commented Dec 14, 2017 via email

@henkbirkholz
Copy link
Member

These are probably two points worthwhile to be captured and worked into an improved definition (if we decide not to drop the term):

  • any IE can be used as data or as metadata, depending on context
  • metadata can be descriptive as well as prescriptive

@sacm
Copy link

sacm commented Dec 16, 2017 via email

@henkbirkholz
Copy link
Member

Hm, whike certainly possible, it would create a lot of redundancy (YANG just removed a similar type of redundancy, via NMDA, by collapsing redundant trees). An overused but viable example: the IE IPv4-Address.

It can be part of an TE identifier -> being metadata.

It can be part of SACM Content -> being data.

Would you recommend to introduce two types of IPv4-Address for this? NETCONF, for example, reverted that kind of decision.

@henkbirkholz
Copy link
Member

Addendum: From an Ontology pov (I did not take that into account in my first reply), it is rather okay to tie (domain/range association) a concept via two different object properties to two different other core concepts (e.g. data & metadata) concepts. Even in the more restrictive scope of taxonimc parents(hip), there can be more than one parent concept (although this would render an a-box rather complex and might not be recommended in a production envirionment, as it requires more complex and reliable reasoners - i.e. potentially creates a lot of reasoning overhead).

@strazzie123
Copy link
Collaborator

strazzie123 commented Dec 17, 2017 via email

@strazzie123
Copy link
Collaborator

strazzie123 commented Dec 17, 2017 via email

@henkbirkholz
Copy link
Member

Sorry, I fail to see: 1) why we care if there is redundancy in an information model a) YANG is a
data model b) the purpose of an info model is to define objects and their relationships;
redundancy is a function of good the model is

I agree, I got distracted by Ontologies :-) Touché. This is about the IM.
Redundancy vs. Readability and in consequence comprehensibility.

Nope. As I said earlier, having something being of two types is problematic at best wrt code
generation. Furthermore, why would an identitifer ever be considered metadata? Isn't an identifier at > least as important as data? :-) And remember, metadata is typically considered optional.

I would still argue it is the same type (e.g. in my example) used in different contexts (e.g. via ontological object property relationships), which provides it with more context, mae it data and metadata in the first place - correspondingly until now I did not understood metadata as specific subset of all types.... like unit32 and unit32-metadata.

You made me become more neutral on this point, with a slight favor of not creating redundant IE types for metadata and data, but I am starting to understand your point better.

@strazzie123
Copy link
Collaborator

strazzie123 commented Dec 18, 2017 via email

@adammontville
Copy link
Contributor Author

Thankfully (?), we are talking about a terminology draft, which is more aligned with an ontological perspective than one tied to a specific information or data model. Back to the original question: Do we, or do we not, keep the term metadata? It is already broken out into definition followed by exposition, so we could simply leave it, but as John pointed out we might want to clean up the expositional text to something more effective than:

In the SACM information model, data is referred to as Content. Metadata about the content is referred to as Content-Metadata, respectively. Content and Content-Metadata are combined into Subjects called Content-Elements in the SACM information model. Some information elements defined by the SACM information model can be part of the Content or the Content-Metadata. Therefore, if an information element is considered data or data about data depends on which kind of Subject it is associated with. The SACM information model also defines metadata about the data origin via the Subject Statement-Metadata. Typical examples of metadata are time stamps, data origin or data source.

John or @henkbirkholz if you have proposed text for the expositional replacement, please provide it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants