Unify the usage of `path` parameter in functions #977

Lesik · 2020-03-26T15:29:07Z

The path parameter is used in multiple functions that Zola adds to the Tera templating engine, like get_url() and resize_image(), as well as in Markdown links like [link](path). However, the way that they resolve the path seems to differ between them.

For example, get_url() seems to do a lookup in static/ for paths that have no leading / or @, and for paths that start with an @ it does a lookup in content/. On the other hand, both get_page() and get_section() start in content/ for paths without leading / or @. It is unclear from the documentation what happens with paths with a leading / in any of these functions.

function	`mypath`	`@mypath`	`/mypath`
[link](mypath)	`$PWD/mypath`	`content/mypath`	?
get_url()	`static/mypath`	`content/mypath`	?
get_page()	`content/mypath`	?	?
get_section()	`content/mypath`	?	?
resize_image()	`content/mypath`	?	?

I propose the unification of the path parameter in some or all of those cases. Let's discuss!

The text was updated successfully, but these errors were encountered:

Lesik · 2020-03-26T15:34:41Z

By the way, I've stumbled upon this while creating a shortcode that resizes images. I wanted to link to the original image as well as the resized one, so both get_url() and resize_image() came into play. But since the path parameters are resolved differently, I couldn't just pass the same path to both functions, which I thought was pretty confusing.

Keats · 2020-03-26T15:48:05Z

Related: #911

Keats · 2020-06-09T20:35:32Z

Can anyone spend some time to figure a nice way to unify them? No need to actually implement it, just think things through

#1044 took the approach of actually allowing basic path and @ path.

vojtechkral · 2020-06-18T21:31:30Z

I ran into this issue - and #788 - during refactoring of the imageproc code. I'm not yet sure how to solve this but I would like to make resize_image() consistent with the others as part of the refactor... I'll try to think of something...

Keats · 2020-06-19T10:11:51Z

Yep, hopefully the next version will have fully consistent paths, I'm thinking about it too.

Lesik · 2020-06-19T11:16:01Z

So I think the easiest one to tackle is resize_image(), as its purpose and use are pretty clear while the other functions can be used for all sorts of things. In my opinion, resize_image()'s path parameter should work exactly like markdown's ![alt](path), since both are for referencing images and that way it's more intuitive. So for paths starting with a normal letter (like team.jpg) it should do a lookup under content/$current_path$/team.jpg. However, we need some way to reference images in static/. For this we could use the / prefix, while a little confusing, it makes sense when you think about that all content in static/ will be on / after running zola.

content
└── about
    ├── imprint.png
    ├── index.md
    └── team.jpg

I think the same should apply for get_section() except that / looks under content/ instead of static/.

vojtechkral · 2020-06-19T13:37:02Z

So for paths starting with a normal letter (like team.jpg) it should do a lookup under content/$current_path$/team.jpg. However, we need some way to reference images in static/. For this we could use the / prefix, while a little confusing, it makes sense when you think about that all content in static/ will be on / after running zola.

Ha, incidentally, I already have this implemented in my local refactor :-) However, I'm not entirely sure this won't make matters more confusing. Also, there's still the issue with #788 which I'm not sure how to solve such that the overall situation is consistent.

vojtechkral · 2020-06-19T13:42:06Z

Also, and this is a bit of a nit, the @path is actually @/path if I'm not mistaken...

Lesik · 2020-06-19T13:48:24Z

Afterwards, we should also address whether or not any or all of these functions should error out if the resource is not found.

Keats · 2020-06-19T15:05:15Z

Another issue with paths: #877 but in that case from a function output

Keats · 2020-06-19T15:12:42Z

And #1035 is roughly the same

vojtechkral · 2020-06-19T17:15:07Z

#1035 is a dupe of #788 IMO

Keats · 2020-06-27T12:36:36Z

Another type of paths that could be added once we resolve the existing ones: https://zola.discourse.group/t/experience-report-after-porting-my-blog/478/3?u=keats

Ie a shortcut for when you have a lot of nesting and unique filenames. Not for the next version though, let's make sure that

it makes sense
every paths are consistent first

Keats · 2020-07-24T20:31:02Z

In #1086 I've decided that current_path/page.path/section.path etc will have a leading /, so if you do window.location.pathname or something it matches. Pretty much every server interpretation of a path has a leading / as well.

Another issue is that some people want to get the URL for nested assets (#1098) but I am not sure about that.

I don't think I want to force people to use leading / in functions though so I think with/without leading / should be the same as it would be confusing to be different imo.
How about $/ to refer to static folder? It's not that common so it makes sense to add a sigil imo.
get_url is tricky as people would expect path without @/ to get the correct url for markdown files if we move it to PWD...

radio-alice · 2020-07-26T19:42:11Z

Hey y'all no idea if this is related, but I'm building an rss reader and noticing that Zola-generated rss.xml files use relative links for images, meaning that they can't be viewed in the reader without me overriding the post's html. I assume the issue stems from this one, so just wanted to chime in to push for defaulting to absolute urls for (processed?) images in rss templates. Sorry if this is the wrong place to report this! Also sorry if this has been fixed, and the feeds I'm looking at were generated by an older version– I couldn't get zola to build an rss file at all, so couldn't check if it's still happening. Feel free to delete if totally irrelevant.

Keats · 2020-07-27T09:01:46Z

At least on my site (built with 0.11, https://www.vincentprouillet.com/rss.xml) some images are absolute and some are relative. I will check why later, that's a bit odd.

southerntofu · 2020-07-27T16:25:51Z

mypath and /mypath are HTML standard for relative URL, respectively to the current page and to the webroot. These are established standards and i believe we should not try to change their meaning.

From what i understand @/mypath was implemented to reference Markdown pages from the content directory. But it just so happens it produces links relative to the website's root, which is very useful as it may be distinct from the webroot.

Relative links to the website's root is also what we want when referencing stuff in the static folder. Couldn't we use this symbol for both cases, and error when an @/link points to a missing ressource (either in content/ or static/) ?

Keats · 2020-07-27T17:42:19Z

Ok so in my blog, if I use markdown images it gets the full permalink but I was using a shortcode for clickable images where I was using relative links, I've just changed it and now all my image links are absolute.

@southerntofu can you expand? Do you mean only allowing @/ or still allowing mypath and /mypath?
I think in all cases, the start of the path should be the website root, and not static like get_url (which really should have been called get_url_for_static since that was the goal and is now confusing)

southerntofu · 2021-01-13T20:39:32Z

@Keats I don't think "/" should reference the site's webroot, otherwise how do you reference the actual virutalhost's webroot? In my view, starting from the site's webroot is what the @/ syntax was introduced for, because reusing / for this would contradict how HTML links work.

In my view, we could have something like this in the context of https://WEBROOT/SITEROOT:

path is a link for relative to the current page (like in HTML)
/path is an absolute link for https://WEBROOT/path (like in HTML)
@/path is an internal link for https://WEBROOT/SITEROOT/path, which can correspond to a piece of content, a static file, or a CSS file generated from SASS (am i forgetting something?)

Two caveats:

pages residing in the same folder as a section cannot don't have access to the same assets, whereas relative links in HTML work on a per-folder basis (counter-intuitive for people from a web background)
relative and internal links should be localized when possible, so that translating an asset like an image (or anything else, really) only requires adding a corresponding asset.LANG.ext

For the first problem, i had written a simple patch which was not merged for good reasons: assets should not be copied many times in many output folders. An alternative way is by generating symlinks instead of copying the files: this would make me happy, but would not be portable to all platforms. Another way is in the path resolution process (in markdown.rs), but then it means we need to expose assets/parenthood information to this rendering process somehow? Yet another way is #840, by having pages exported to the same folder as the section, all pages in a section can link to the same assets, but this does not solve the problem for people who do not want "ugly URLs".

For the second problem, there is one way i mentioned before for templates by exposing a has_page function which enables a theme author to make macros to deal with localization of content in a flexible manner. However, this approach would not enable localization of Markdown links (except through shortcodes), and places the burden of translations support on the theme authors.

Maybe unified path resolution could address both caveats? However, a concern is what to do when there is a missing piece of content or translation? Currently, zola will not complain for "wrong" relative URLs in a page. That's good on one hand because assets can be dynamically generated outside of zola's reach, placed in the right folder, and will become available after site generation (that's how relative URLs work in HTML). On the other hand it's bad because it does not catch typos in assets names (unless you use a full internal link).

Maybe different strategies in how to deal with failed resolution apply to different situations? If we unify path resolution, we'll make a lot of things easier, but we still need control over whether a missing translation or a missing piece of content should produce an error, a warning, and/or have a fallback.

For markdown resolution, this could be inherited from a section and be overwritten on a per-page basis. Something like missing_content_error (error, warn, none) and missing_translation_error (error, warn, none). In the templates, maybe all functions dealing with path could have similar parameters, defaulting to parameters from site config?

Additionally, if templates can silently detect failed resolution, it can introduce new usecases like:

displaying "missing translation" messages with links to where a user can contribute translations for their language for specific pieces of content (maybe a new banner?)
displaying "missing data" messages where we use load_data, with instructions how to generate the data (could be useful for a test result page like here)

What do you think?

Keats · 2021-01-14T20:59:55Z

I think it makes a lot of sense, but I need to think more about the i18n cases and it can be a bit separated from this issue.

I think the main issue here is get_url but I don't know if the path to an asset in static should start with @/ because it refers to link to content only in markdown (where we still use relative links for static assets). I haven't given much thoughts to this issue recently, I'll need to spend some time thinking about it.

Keats · 2021-05-10T19:47:36Z

Ok time to actually work on that so it finally makes sense and to handle i18n better. How does that look to everyone?

function	`mypath`	`@/mypath`	`/mypath`
[link](mypath)	`$PWD/mypath`	`content/mypath`	`$BASE_URL/mypath`
get_url()	[`$PWD/mypath`, `content/mypath`]	`content/mypath`	`$BASE_URL/mypath`
get_file_hash()	[`$PWD/mypath`, `content/mypath`]	`content/mypath`	N/A
resize_image()	[`$PWD/mypath`, `content/mypath`]	`content/mypath`	N/A
get_image_metadata()	[`$PWD/mypath`, `content/mypath`]	`content/mypath`	N/A
load_data()	[`$PWD/mypath`, `content/mypath`]	`content/mypath`	N/A
get_page()	`content/mypath`	`content/mypath`	N/A
get_section()	`content/mypath`	`content/mypath`	N/A

The idea is that it should be easy to:

address any content that currently doesn't work (for example get_url for a feed with a specific language)
use any combinations in the templates so we don't need to do crazy workarounds like Can't get directory path to collocated assets when using YYYY-MM-DD_ page prefix #788 (comment)

Next steps are to write some tests for every issues mentioned and see if the above would solve them.
get_page/get_section can only be used to refer to something in content/ so whether or not we are putting the @ we would still look in content. This might make some things easier when building paths in templates.
It is a breaking change for get_url as people will need to do static/my.js instead of my.js and it should also work on colocated assets somehow.

[$PWD/mypath, content/mypath] means we are actually checking from both places in that order.

I'm also a bit unsure on where to allow /mypath.

Any thoughts?

Keats · 2021-05-12T19:08:06Z

I've started work on #1455

Keats · 2021-05-17T18:26:19Z

The state of #1455 which I will merge soon:

more tests (🎊 )
some functions have been marked as safe (eg, no need anymore to do get_url() | safe, just get_url() will output the same string)
resize_image now returns both the new image URL as well as its path in the static directory, allowing it to be chained with get_image_metadata

Now for the paths themselves, all the functions working on files (get_url, get_file_hash, resize_image, get_image_metadata, load_data) will not use the following logic in this order:

/// 1. base_path + path
/// 2. base_path + static + path
/// 3. base_path + content + path
/// A path starting with @/ will replace it with `content/`

In the PR state, this means that pretty much anything starting with / will not work.

A few questions remain:

page.path and section.path have a leading / so just concatening things to it would fail. Until now, get_url was stripping leading / from the given path, should I add a rule doing that in all functions?
get_url will still end up doing a best guess for some things: for example calling it in the index with a path to a colocated asset will give you back an invalid link since it doesn't look around to check if it's a colocated asset. I could add that
but is it worth it?
Calling get_image_metadata on a resized image will fail the first time, since images are processed after the templates. I've added a allow_missing argument that a user can use to ignore file not found as a stop gap solution - I'm not sure what is the best thing to do there. cc @vojtechkral

What do people think?

vojtechkral · 2021-05-17T19:12:49Z

Calling get_image_metadata on a resized image will fail the first time, since images are processed after the templates. I've added a allow_missing argument that a user can use to ignore file not found as a stop gap solution - I'm not sure what is the best thing to do there. cc @vojtechkral

Good question, don't know exactly what would be best right now either. I've pulled your branch and I'm having a look. Will try to figure out something...

vojtechkral · 2021-05-17T19:20:45Z

@Keats How about we make both resize_image() and get_image_metadata() return the same kind of object: {url, static_path, width, height}. resize_image() Can load the metadata right away just like get_image_metadata() and compute the dimensions, this shouldn't slow it down too much. The actual resize can still be done later in a deduped and parallel fashion. Does that sound ok?

Edit: So many typos, sorry

Edit2: I am willing to implement this, I've been procrastinating on the imageproc cleanup long enough and I've just finished some other stuff, so I should be able to get around to it finally...

Keats · 2021-05-17T20:02:44Z

That's a good idea! If you can do the PR on #1455 it would be great

Edit: wait, how would do know the sizes with something like a fit where you only pass the width?

vojtechkral · 2021-05-17T20:30:13Z

Edit: wait, how would do know the sizes with something like a fit where you only pass the width?

I think that should be computable from the input width and image metadata...
Edit: Actually, having the same type for resize_image() and get_image_metadata() doesn't help, it's quite different, but the idea of computing the size beforehand is ok I think.

vojtechkral · 2021-05-20T21:30:10Z

@Keats FWIW, I'm working on it, about half way through, will try to report back asap with something...

Edit: WIP commits in my paths-vojtech branch. Will yet need to test a bit more, clean it up a bit, add docs etc.

Keats · 2021-05-21T07:57:30Z

Cheers, take your time we're not in a hurry. What do you think of the other functions change? Does it make more sense now?

vojtechkral · 2021-05-23T20:58:36Z

@Keats I've created a PR at #1484. The changes aren't documented yet, I suppose I would update the doc after we figure what the end result is going to be.

I think overall the change makes sense as far as I understand. Does the $PWD in the table refer to the directory of the current MD or HTML file? This isn't yet implemented, right? Or maybe I misunderstand...

Keats · 2021-05-24T06:26:24Z

Look at the comment in #977 (comment) rather than the table it should be clearer. I'll have a look at the PR

adworacz · 2021-05-29T00:48:18Z

Does this new implementation address the use case presented in #1161 around get_image_metadata and colocated assets?

Specifically, being able use a shortcode and use it in Markdown like so:

Markdown - content/example/index.md:

{{ image(src="image-example.png") }}

Template - templates/shorcodes/image.html:

{% set meta = get_image_metadata(path=src) %}
<img src="{{src | safe}}" width="{{meta.width}}" height="{{meta.height}}" />

Or do we still need to use the page.path variable and strip the leading slash?

Keats · 2021-05-30T08:28:47Z

do we still need to use the page.path variable and strip the leading slash?

You will still need to use the page.path variable but I will make it so you don't have to strip the leading slash.

Keats · 2021-06-10T08:56:43Z

It's all available in the next branch now, thanks to @vojtechkral for the improvements on the image handling. All functions operating on files now use the same helper function to get the path so it can't be more unified than that!

Keats · 2021-06-11T21:36:27Z

@vojtechkral any reason we are not caching get_image_metadata response? I can add that if it's ok

vojtechkral · 2021-06-13T18:00:23Z

@Keats Missed the notification, sry. Anyway, no reason, agreed it would be a good idea to cache it. Not sure if I can get to it sooner than next weekend :/

Keats · 2021-06-14T07:38:20Z

Ah I didn't mean you need to be the one to do it, I can do it myself! Just wanted to double-check it makes sense

tusamni · 2021-07-07T01:40:29Z

Any idea when the next branch will be merged?

I'm trying to access images in the static folder using get_image_metadata and struggling to figure out the correct path.

Keats · 2021-07-07T09:16:49Z

Maybe next week? Just trying to find bugs and fix them before release. You can try the next branch and report if everything works for you there!

Keats · 2021-07-19T19:02:41Z

Should be fixed in 0.14 which is now released! Thanks all for the feedback and please open issues if you encouter bugs or weird things.

Keats added enhancement Feedback wanted labels Mar 26, 2020

Infides mentioned this issue May 24, 2020

Bug with directory prefix and the get_url() and resize_image() functions #1035

Closed

seritools mentioned this issue May 24, 2020

Add a function to consistently resolve link URLs in templates the same way as in markown #1037

Closed

Keats mentioned this issue Jul 2, 2020

Can't get directory path to collocated assets when using YYYY-MM-DD_ page prefix #788

Closed

Keats mentioned this issue Jul 27, 2020

Inconsistent current_path #1086

Closed

Keats mentioned this issue Sep 8, 2020

get_image_metadata: Cannot find path #1161

Closed

southerntofu mentioned this issue Jan 13, 2021

asset links adaptation to subdirectory #1205

Closed

southerntofu mentioned this issue Feb 22, 2021

Significantly improve the README #1373

Closed

Keats pinned this issue May 11, 2021

Keats closed this as completed Jul 19, 2021

Keats unpinned this issue Jul 19, 2021

southerntofu mentioned this issue Feb 27, 2022

Discussion: Reenable link rewriting for colocated assets #1779

Closed

Unify the usage of path parameter in functions #977

Unify the usage of path parameter in functions #977

Comments

Lesik commented Mar 26, 2020 • edited Loading

Lesik commented Mar 26, 2020 • edited Loading

Keats commented Mar 26, 2020

Keats commented Jun 9, 2020 • edited Loading

vojtechkral commented Jun 18, 2020 • edited Loading

Keats commented Jun 19, 2020

Lesik commented Jun 19, 2020 • edited Loading

vojtechkral commented Jun 19, 2020 • edited Loading

vojtechkral commented Jun 19, 2020

Lesik commented Jun 19, 2020

Keats commented Jun 19, 2020

Keats commented Jun 19, 2020

vojtechkral commented Jun 19, 2020

Keats commented Jun 27, 2020

Keats commented Jul 24, 2020

radio-alice commented Jul 26, 2020

Keats commented Jul 27, 2020

southerntofu commented Jul 27, 2020

Keats commented Jul 27, 2020

southerntofu commented Jan 13, 2021

Keats commented Jan 14, 2021

Keats commented May 10, 2021 • edited Loading

Keats commented May 12, 2021

Keats commented May 17, 2021 • edited Loading

vojtechkral commented May 17, 2021 • edited Loading

vojtechkral commented May 17, 2021 • edited Loading

Keats commented May 17, 2021 • edited Loading

vojtechkral commented May 17, 2021 • edited Loading

vojtechkral commented May 20, 2021 • edited Loading

Keats commented May 21, 2021

vojtechkral commented May 23, 2021 • edited Loading

Keats commented May 24, 2021

adworacz commented May 29, 2021

Keats commented May 30, 2021

Keats commented Jun 10, 2021

Keats commented Jun 11, 2021

vojtechkral commented Jun 13, 2021

Keats commented Jun 14, 2021

tusamni commented Jul 7, 2021

Keats commented Jul 7, 2021

Keats commented Jul 19, 2021

Unify the usage of `path` parameter in functions #977

Unify the usage of `path` parameter in functions #977

Lesik commented Mar 26, 2020 •

edited

Loading

Lesik commented Mar 26, 2020 •

edited

Loading

Keats commented Jun 9, 2020 •

edited

Loading

vojtechkral commented Jun 18, 2020 •

edited

Loading

Lesik commented Jun 19, 2020 •

edited

Loading

vojtechkral commented Jun 19, 2020 •

edited

Loading

Keats commented May 10, 2021 •

edited

Loading

Keats commented May 17, 2021 •

edited

Loading

vojtechkral commented May 17, 2021 •

edited

Loading

vojtechkral commented May 17, 2021 •

edited

Loading

Keats commented May 17, 2021 •

edited

Loading

vojtechkral commented May 17, 2021 •

edited

Loading

vojtechkral commented May 20, 2021 •

edited

Loading

vojtechkral commented May 23, 2021 •

edited

Loading