Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

In theory: Can MPC parse C? #135

Open
IngwiePhoenix opened this issue Apr 20, 2021 · 2 comments
Open

In theory: Can MPC parse C? #135

IngwiePhoenix opened this issue Apr 20, 2021 · 2 comments

Comments

@IngwiePhoenix
Copy link

I am currently trying to look for a way to strip function definitions and the likes from C source files to use in the V language as a basis for writing wrappers.

Short summary:

struct my_struct_s {int a;};
void something(struct my_struct_s* s);

becomes:

#include "my_struct.h"
struct C.my_struct_s {
  a int
}
fn C.something(C.my_struct_s)

This works with enums, unions and the likes the same way. But to get there I ned to extract related information from header files.

Is it possible to use mpc to do this? And if so, how?

@orangeduck
Copy link
Owner

Short answer: no

Long answer: yes

Basically the complete and full parsing of C is difficult because typedefs can mean you don't know if a symbol is a type or an identifier and this changes how you parse things dynamically. It means you will probably struggle to parse it using a grammar only approach since when parsing C you need to evaluate typedefs as you encounter them.

However since mpc allows you to have callbacks or functions you apply associated with particular parsing rules when you build your parser via the functional approach it probably is possible to hack something together which allows for the parsing of C.

Some subset of C may well be fairly straight forward but ultimately I think if you want to parse the complete language of C you are in for a difficult time if you want to do it via mpc.

@IngwiePhoenix
Copy link
Author

I am sorry I only get to see this answer now... oops.

So if I understand correctly: Parsing the raw syntax should be possible, at least to generate an AST. But actually extracting full information would mean I would have to evaluate typedefs (as in, creating a typemap as I go and resolving defined types off that as they are encountered)? That would be perfectly fine, as V compiles to C there is no real need to fully resolve types - just to get the plain definition of other symbols (i.e. translate libraryType funcName(); to the equivalent notation in V, which would be fn C.funcName() C.libraryType).

Again, sorry for the super late reply!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants