-
Notifications
You must be signed in to change notification settings - Fork 408
[Feat](udf) Support Python UDF/UDAF/UDTF for Doris #3274
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
| :::tip 提示 | ||
| **环境依赖**: 使用 Python UDF 前,必须在所有 BE 节点的 Python 环境中预先安装 **`pandas`** 和 **`pyarrow`** 两个库,这是 Doris Python UDF 功能的强制依赖。详见 [Python UDF 环境配置](python-user-defined-function#python-udfudafudtf-环境配置与多版本管理)。 | ||
|
|
||
| **日志路径**: Python UDF Server 的运行日志位于 `output/be/lib/udf/python/python_udf_output.log`。用户可以在该日志中查看 Python Server 的运行情况、函数执行信息和调试错误。 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
修改这个日志的路径,统一放在log目录下
| ) | ||
| AS $$ | ||
| class ClassName: | ||
| def __init__(self): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
reset也是调用这个,得注明一下
| count += 1 | ||
| ``` | ||
|
|
||
| #### Q6: 如何在 Python UDTF 中使用第三方库? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这部分应该独立出来,没必要udtf这里单独写
|
|
||
| **解决方案**: | ||
| 1. 检查所有节点的 Python 版本和依赖版本 | ||
| 2. 统一使用 `requirements.txt` 或 `environment.yml` 部署环境 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这两个文件的使用方法需要给文档解释或这个给链接哈
Versions
Languages
Docs Checklist