You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
➜ aiac dockerfile for nodejs with comments
```Dockerfile# Use the official Node.js 14 image as a parent imageFROM node:14# Set the working directory in the containerWORKDIR /usr/src/app# Copy package.json and package-lock.json (if available) to the working directoryCOPY package*.json ./# Install project dependenciesRUN npm install# Copy the rest of the application code to the working directory[S/s]: save and exit[W/w]: save and chat[C/c]: continue chatting[R/r]: retry same prompt[Y/y]: copy to clipboard[Q/q]: quitChoice: c✔ New message: continue█# Copy the rest of the application code to the working directoryCOPY ..# Build the application if necessary (e.g., if using a build step)RUN npm run build# Expose the port the app runs onEXPOSE 3000# Define the command to run the appCMD ["npm", "start"]This Dockerfile is designed for a Node.js application that listens on[S/s]: save and exit[W/w]: save and chat[C/c]: continue chatting[R/r]: retry same prompt[Y/y]: copy to clipboard[Q/q]: quitChoice: c✔ New message: continue█# Define the command to run the appCMD ["npm", "start"]# Add a health check to ensure the application is running properlyHEALTHCHECK --interval=30s --timeout=30s --start-period=5s --retries=3 \ CMD curl -f http://localhost:3000/ ||exit 1# Optional: Set up a volume for persistent dataVOLUME [[S/s]: save and exit[W/w]: save and chat[C/c]: continue chatting[R/r]: retry same prompt[Y/y]: copy to clipboard[Q/q]: quit✔ Choice: q█
The endpoint provides information about the ma_tokens, but I guess they are not used?
Is it somehow possible to add/modify the max_new_tokens parameter?
Like curl -N https://deepseekv2.mycluster/generate -X POST -d '{"inputs":"dockerfile for nodejs with comments?","parameters":{"max_new_tokens":200}}' -H 'Content-Type: application/json'
The text was updated successfully, but these errors were encountered:
I'm not familiar with Huggingface, but I see that it implements the OpenAI API (although your examples seem to be its own API?) with a relatively small default for "max_tokens". I suppose we can expose other parameters too, do you think it would make sense though to add something like max_tokens to the backend configuration, rather than doing this on a per-request basis (e.g. as a flag in the CLI or a parameter in the library)? Which would make more sense for your use case?
➜ aiac --version
aiac version 5.2.1
We are using local backends provided by huggingface TGI
Invoking aiac results in a truncated response
The endpoint provides information about the ma_tokens, but I guess they are not used?
Is it somehow possible to add/modify the
max_new_tokens
parameter?Like
curl -N https://deepseekv2.mycluster/generate -X POST -d '{"inputs":"dockerfile for nodejs with comments?","parameters":{"max_new_tokens":200}}' -H 'Content-Type: application/json'
The text was updated successfully, but these errors were encountered: