-
Notifications
You must be signed in to change notification settings - Fork 12.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proposal: embedded syntax block #3022
Comments
|
It's meant to transforms one string to an another. So a perfect use case is embedding HTML templates or creating your own CSS language that can live inline with your TS code. But instead of writing it in strings( It isn't meant to be a "full" programming language to "full" programming language embed. So it isn't meant that you can embed PHP code. There is no benefit of embedding a full programming language than to write TS code alone. JSX described itself as "XML-like syntax extension to EcmaScript" — and that is what this embed functionality is meant for extending the TS language with additional syntaxes.
The proposed embedded syntax block is only functioning as sugar. If the embed already contains TS code that are marked(in JSX it is marked inside
yes.
I missed this in my proposal. If balancing works then yes. |
My point is that it does. To take your example: syntax 'jsx' { <div>Hello {this.props.name}</div> } I assume that this will be passed as the string Now consider if this TS expression contains braces, double quotes, escapes, etc. Now transpile() has to implement some of the TS parser to be able to make sense of this embedded DSL. In fact, there can be more than one such syntax 'jsx' { <div>Hello { `this.props.name${ "I'm a template string expression which has its own braces
and contains newlines. I could even contain entire JS statements inside IIFEs." } { This is an unterminated open brace` } }</div> } transpile() will have a hard time parsing this and converting it into real TS without being able to invoke the TS parser on it. What you're essentially describing is a very unrestricted pluggable parser-emitter, which I'm not sure is doable with a grammar such as TS. Compare this to, say, sweet.js, where the unit of transformation for the macro engine is pre-parsed tokens that vaguely resemble JS, not arbitrary text. |
It was a rough proposal 😄 . But one can probably make the TS evaluated only inside Instead of having balanced braces for calculating the end of the embed. All characters are feed one-by-one to a scanner/parser of the transpiler including the open brace |
Perhaps better registered as |
Yep, exactly my point. That is, your transpile() isn't a function from source string to TS. Rather, the TS parser needs to first split the text inside the syntax block into strings and expressions, and transpile() would get such a sequence of strings and expressions that it can then compose into the final TS codegen. (And that should sound a lot like tagged template strings, because it is. Although template string tag functions deal with the results of embedded expressions, not the expressions themselves; but it's the same in that the parser needs to be aware of how to parse them and the tagger needs to work on the separate components. Of course, the value here is to retain the existing DSL rather than have to convert it to template strings.) |
Just for clarifications, there is no need for splits? I meant So an end can be calculated even if we have(no need to involve the TS parser): syntax 'jsx' { <div>Hello {{`hello world :{ ${message`}}</div> } The above example will generate this invalid TS code(from valid JSX code): React.createElement('div', null, 'Hello' , `hello world :{ ${message`); |
No matter how complicated you make your delimiter, it'll always be possible to use it in a way where it doesn't function as a delimiter. syntax 'jsx' { <div>Hello {{ "hello world }} {{" }}</div> } consists of a string "Hello" and a single expression - the string "hello world }} {{". A naive algorithm that doesn't understand TS syntax and only looks for the delimiter would instead infer two expressions. |
As mentioned earlier, you would need to escape syntax 'jsx' { <div>Hello {{ "hello world \}\} \{\{" }}</div> } The transpiler will then unescape those: React.createElement('div', null, 'Hello' , "hello world }} {{"); |
True, I leave it up the TS team to decide if they want to go along this road. Then I guess, it also need to have a command line option. |
Side note: if possible perhaps having the shorter syntax 'jsx' { <div>Hello {{this.props.name}}</div> } |
I think a solution that didn't add any extra syntax to the language, except for the DSL that is embedded, would be much cleaner (something like Babel's transformers). |
@icetraxx isn't For the Babel transforms: This solutions doesn't create an additional loop and doesn't rely on TS team to support any new AST nodes. |
Definitely plus one this. 👍
I agree. TypeScript should look at solving this problem in the abstract rather than implement a solution that favours one specific framework. There are lots of frameworks out there that would benefit from this approach, e.g. // myModel.ts
var viewModel = {
firstName: ko.observable('Joe'),
lastName: ko.observable('Bloggs')
};
ko.applyBindings(viewModel); // myView.ts
'knockout' {
<p>First name: <input data-bind="value: firstName" /></p>
<p>Last name: <input data-bind="value: lastName" /></p>
} I would want to see the embedded syntax type-checked and produce HTML as output. |
@jbondc I believe any kind of DSL that relies on a general purpose language e.g. HTML, CSS, SQL etc are better written as a syntax embed then on a separate file or a multi-line string. Here is an example with Facebook's GraphQL(don't know the whole spec so bare with me): getUser() {
return 'gql' {
@user(1000) {
name,
age
}
}
} Just looking at the their React conference. GraphQL will be based on tagged template strings. It could be much better be based on a syntax embed instead. |
Any feedback from TS team? @ahejlsberg @CyrusNajmabadi @mhegazy etc. |
In thinking about this, we already have a precedent in TS for how domain specific languages (DSLs) like the above are handled: we use the file extension. We support .ts for normal source and .d.ts for declare files. I get there is some ergonomic disadvantage to not being able to embed syntax blocks, but it also comes with some complications. Instead, one thought may be to support addition ..ts. Like .jsx.ts for example. That way it's on a file-by-file with natural delineation. |
One problem with having the file extension is that only one type of DSL can be accepted per file. Just looking at With this solution you don't need to write a compiler from scratch if you want to invent your own language extension. Just write your extension and not your language. Source: |
@tinganho You could potentially compose multiple DSLs with file extensions, i.e. |
@icetraxx true, but then you have the problem that some let say a React components have Having file extensions also implies no embedded solution. So you still have the problem with: how do you implement a language extension without writing a compiler from scratch? Only very big companies like Facebook can afford to do that. How do you deal with multiple language extension embedded in each other? One language extension have a defined behavior with it's parent language, not another language extensions. With embeds you won't have the problem because you can only have one per embed and they are enclosed in |
@jbondc that is embedding |
JSON is another possible candidate that will benefit from this feature: interface User {
firstName: string;
lastName: string;
age: number;
}
'json' {
User: {
"firstName": "Joe",
"lastName" : "Bloggs",
"age": "54" // Error: number expected
}
} Expected output {
"firstName": "Joe",
"lastName" : "Bloggs",
"age": "54"
} Related issue #2064 |
@NoelAbrahams that would be definitely helpful! |
An alternative that does not require a new syntax with its own ways of embedding expressions, escapes, etc: jsx`<div>Hello ${ this.props.name }</div>`;
gql`
@user(1000) {
name,
age
}
`
@compile
function jsx(strings: string[], expressions: string[]): string {
...
} Syntax looks identical to template strings. The parser and the user do not need to learn a new syntax, such as syntax for escaping, embedding TS expressions, balancing braces/backticks, etc. The only difference from actual tagged template strings is that the The presence of the Note that both the original suggestion as well as with this one don't give syntax highlighting, completion and the other LS goodies for the DSL, even for syntax that's obviously JS/TS like in @NoelAbrahams 's comment. They only give this for the embedded TS expressions, if any ( Also @NoelAbrahams, note that the original proposal (and mine) do not handle type-dependent transformations (I asked this in my first comment). The transformer only has access to the string, not the types of the expressions it encounters or have been encountered before, so it does not have access to the interface definition to be able to transform "54" to 54, or complain about "54" vs 54. (This can be handled by having the transformer take a ts.TypeChecker parameter, but then it has to run in a separate step after the parser and checker have already run once. This might be a cleaner approach in general at the cost of increasing compile time.) |
There is a problem with having tagged templates. It becomes apparent when you want to nest things. http://facebook.github.io/jsx/#why-not-template-literals. You would need to type I'm not sure if decorators should be used as meta data for compile time, they are right now just there for providing runtime meta data. And the jsx function could be huge for supporting jsx and other DSL:s. Syntax highlighting can be supported by the IDE developers. I think code completion and other related tooling can be doable by just providing another hook for it. |
The concerns on that page are for JSX via template strings at runtime, not at compile time. For example, it says And no, you would not need to nest jsx'' inside jsx''.
As I said, the decorator is unnecessary since the transformer needs to be explicitly known to the compiler anyway.
It is doable, and as I said, it will require running the transformer after running the type checker, which itself runs after the parser. Thus this will require a separate transformer step instead of being in line with the parser step. |
+1 I thought some people might consider this as something new (or even weird), but it's actually not. Remember the asm block? __asm {
mov al, 2
mov dx, 0xD007
out dx, al
} Yeah. As stated before, what we are going to put in these blocks will probably care about more stuff like types. So, why not pass data between target transpiler? Let's say that changes in TS2 broke some of our code and we want to use that code until we find the time to port it: /// <transpiler name="ts1" path="path/to/ts1" />
var numA: number;
var add = function add(a: number, b: number): number {
return a + b;
};
__ts1 {
import foo = require('../foo.ts');
numA = foo.getNumA();
var numB = foo.getNumB();
}
var total = add(numA, numB); TS can send information about the current scope (like the vars and funcs in it and their types) to the transpiler and if the transpiler cares, it can use that information and send the new information back after it's done. For transpilers which do not support this, we could use ...
__purejs {
var numB = 13;
}
declare var numB: number;
var total = add(numA, numB); // No errors because we let TS know that there would be a numB with type 'number' And to compile my thoughs on how the syntax should be: // These can be both here and in `tsconfig.json`:
/// <transpiler name="jsx" path="path/to/jsx" />
/// <transpiler name="babel" path="path/to/babel" />
// Using the same transpiler with different configuration:
/// <transpiler name="babel2" path="path/to/babel" configuration="../babel2.json" />
// Transpile JSX in a block and provide missing type information:
__jsx {
var myDiv =
<div>
<div>
<span>
</span>
</div>
</div>;
} // JSX is told to read until "}"
declare var myDiv: ReactElement;
// Or do the exact same thing with the "__transpiler var" sugar:
__jsx var myDiv: ReactElement =
<div>
<div>
<span>
</span>
</div>
</div>; // JSX is told to read until ";"
// Or:
var myDiv: ReactElement;
__jsx {
myDiv =
<div>
<div>
<span>
</span>
</div>
</div>;
} // JSX is told to read until "}"
// Or with sugar:
var myDiv: ReactElement;
__jsx myDiv =
<div>
<div>
<span>
</span>
</div>
</div>; // JSX is told to read until ";"
// Import a file with "__transpiler import" after transpiling it with Babel:
__babel import {Foo} from 'foo.js'; // Babel is told to read file "foo.js"
// Note that the previous line doesn't do this:
__babel {
import {Foo} from 'foo.js';
} // Babel is told to read until "}"
// Babel does not get that "import" line, it gets the content of the file. |
Arbitrary embedded DSLs are completely out of scope for TypeScript. As for JSX, the proposal here doesn't even solve the problem of using JSX and TypeScript. Nobody is going to consider code like I get that we don't want to "pick winners" in terms of frameworks, but JSX really is a special case. It has wide support and strong adoption, solves a readily-identified problem in the JavaScript world, occupies a small and reasonable part of the grammar, and is one of our biggest adoption blockers ("I want to use TS, but we use JSX"). |
For starter, I'm not sure if judging a super rare case like the escaping example is fair. But anyway:
I just discovered recently that it's actually possible to have a clean support for JSX.
This proposal was having tooling support?
The thing is only a few companies and people have the resources to create something like JSX. I just think that this proposal will create an innovation platform for those who want to create the next web framework and many of them will be inspired by JSX. Though writing a compiler from scratch plus the extended syntax might prove to be too daunting for people. With the proposed embed, people don't need to write a compiler from scratch, just the extension. I could create a POC of this but only if it got a chance(even the slightest) to be landed. |
😄 I am with you in wanting to put flow out of business (it is costly to hire, train, and maintain staff requiring different technical skills). But let's not get carried away and claim that there is something special about JSX/React: the competing frameworks have even wider support and stronger adoption. |
Having native React support will immediately move it to the top of my UI tech stack preference 😄 |
Can you be more specific? Which of these extend JS syntax? |
Knockout for example defines its logic in the |
You also forgot to mention all template languages JSX is just a new kid on the block fed with a silver spoon. |
I am okay with the TS team adding native JSX support specific to React 🌹. Still curious about non react implementations though e.g. https://github.com/vjeux/jsxdom this is a fundamental motivation behind the jsx spec : https://facebook.github.io/jsx/#transpilers < is this out of scope as well or needs a different proposal? |
Also notice that Handlebars has 10x more downloads than React on NPM: Charts for Handlebars: |
Not that I am concerned about this specific conversations, but making architectural decisions based on the number of downloads somethings has on NPM is quite ill advised logic. |
I definitely agree with you, but wasn't it the numbers that made the TS team include JSX in the first place? And just if I can speculate(which is bad) added the postfix type assertion operator |
@tinganho worth your review : TypeStrong/atom-typescript#362 |
@basarat Atom-typescript is a great plugin. But I think this feature should be implemented in TS for various reason, one of them being that not everyone is using the same IDE. Another reason is that this problem belongs to the TS compiler, so it would be best if they could fix it. @RyanCavanaugh how about instead of an embed having pluggable syntax support? Acorn have already re-written its parser to support syntax plugins. The current TS JSX implementation could then be rewritten to use it? Though it would require some hooks to be implemented in the scanner, parser emitter checker etc. It looks very nice in their example code: Relevant info: Relevant discussion: |
I'm thinking:
Making the need for any escaping very unlikely. This is very much like |
@basarat, I like what you are trying to do. |
Here is another syntax that turned up in my research, Rust uses I still prefer my stronger delimiters |
@basarat turns out that Rust already has an syntax extension API https://doc.rust-lang.org/book/compiler-plugins.html. |
@tinganho Yup. That's what I meant with "Rust uses |
I have watched this PR #2673 and this #296 discussion for a while. But I somehow don't think it is the right approach to accept
JSX
directly intoTS
. There seem to be a demand of embedding HTML and even CSS in JS. Facebook might be the pioneer in embedding HTML in JS and therefore gotten a lot of attention recently withJSX/React
. IfJSX
got accepted intoTS
, it would probably send a clear message to the JS and TS community that the TS team thinks JSX is the right way(and the only way) to go in terms of rendering views. I think it is more a correct way to accept an "embedded syntax block", so different frameworks have a chance to compete with JSX. It is a more generalized approach as oppose to accepting JSX directly into the TS.Also remember that @ahejlsberg said "A framework is created nearly every day". So who knows if React will survive the next 2 years? Just remember how fast
Angular 1
was popular and very quickly perceived as outdated(roughly a year?). That's how fast the web moves today.So in rough terms here is my thoughts on this "embedded syntax block". I will take
JSX
as an example:First, we refer to a JSX to TS transpiler. So we can transpile JSX to TS in our source code:
///<transpiler name="jsx" path="../node_modules/jsx-ts-transpiler/transpiler.ts"/>
Now, we define a JSX block we want transpiled into TS using the syntax
syntax 'jsx' {...}
:The TS's
scanner
stops atsyntax 'jsx' {
and feeds the string that reaches the closing brace}
of the syntax block to the JSX-TS transpiler in this case:<div>Hello {this.props.name}</div>
. The JSX-TS transpiler emits the correct TS code back along with a source map so we can map any lines and columns in the emitted TS back to our JSX. Here is how the transpiler looks like:Now the TS scanner goes ahead an parse it as if it was a normal TS file.
Now, we want all AST nodes outside of any "embedded syntax block" to have the same positions(
pos
andend
) as if they where without any line and column shifts. So that we can have the correct positions on errors and warnings autocomplete etc. This is easily done by just offsetting a node'spos
andend
by keeping track of how much the emitted JSX-TS output shifts in character positions. For the emitted AST nodes we would marked them coming from a particular embedded syntax block(the embedded syntax block itself is an AST node withpos
andend
). The emitted TS nodes will be assignedpos
andend
beginning from0
and not the start of its' syntax block.Now, we have all the information we need to output the correct positions in source map and diagnostics. The diagnostics and the source map needs to be rewritten to take into consideration the new embedded syntax blocks along with its' emitted TS code.
I think this is a more open approach. If you accepts JSX directly into TS, then you are practically closing the door for any future syntax embeds and letting Facebook decide the future of syntax embeds.
Updates 1:
Since the TS scanner can't decide where a syntax embed block ends we would need the transpiler to help us so here is the updated transpiler:
The TS
scanner
pushes characters to our transpiler withpushChar()
and receives true or false, whether to proceed with pushing characters to our transpiler or get the transpiled output withtranspile()
.The syntax for defining syntax embed could drop the
syntax
keyword:The text was updated successfully, but these errors were encountered: